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ABSTRACT 

We explore the properties of the submillijansky radio population at 20 cm by applying a newly 
developed optical color-based method to separate star forming (SF) from AGN galaxies at intermediate 
redshifts (z < 1.3). Although optical rest-frame colors are used, our separation method is shown to 
be efficient, and not biased against dusty starburst galaxies. This classification method has been 
calibrated and tested on a local radio selected optical sample. Given accurate multi-band photometry 
and redshifts, it carries the potential to be generally applicable to any galaxy sample where SF and 
AGN galaxies are the two dominant populations. 

In order to quantify the properties of the submillijansky radio population, we have analyzed ~ 2, 400 
radio sources, detected at 20 cm in the VLA-COSMOS survey. 90% of these have submillijansky flux 
densities. We classify the objects into 1) star candidates, 2) quasi stellar objects, 3) AGN, 4) SF, and 5) 
high redshift [z > 1.3) galaxies. We find, for the composition of the submillijansky radio population, 
that SF galaxies are not the dominant population at submillijansky flux levels, as previously often 
assumed, but that they make up an approximately constant fraction of 30 — 40% in the flux density 
range of ~ 50 \iJy to 0.7 mJy. In summary, based on the entire VLA-COSMOS radio population at 
20 cm, we find that the radio population at these flux densities is a mixture of roughly 30 — 40% of 
SF and 50 - 60% of AGN galaxies, with a minor contribution (~ 10%) of QSOs. 

Subject headings: Galaxies: surveys - Cosmology: observations - Radio continuum: galaxies 



1. INTRODUCTION 

The most straight-forward information that can be 
derived from extra-galactic radio sky surveys are 
the radio source counts, which have bee n exten- 
sively studied in the last t hree decades (ICondonl 
1984 IWindhorst et~ai1 119851: IGruppioni et alj 119991: 
Seymour et alJl2004 ISimpson et al.ll2006h . If space was 
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Euclidian, and there was no cosmic evolution of radio 
sources, then the differential source counts would fol- 
low a power law with exponent of 2.5 (see e.g. iPetersonl 
|1997[) . Hence, the observed slope (and the change of the 
slope) of the radio source counts in different flux density 
ranges provides an insight, although quite indirect, into 
the global properties of extra-galactic radio sources, and 
their cosmic evolution. Past studies have shown that 
at 1.4 GHz flux densities above ~ 100 mJy the source 
counts are dominated by 'radio-loud' AGN with lumi- 
nosities above the Fa naroff fc Riley (197 4; FR) break 
(~ 2 X 10 25 W Hz" 1 ; iWillott et al1l2002h . Decreasing 
from about 100 mJy to 1 mJy, the sour ce counts follow 
a power law (e.g. IWindhorst et al.lll985f) . and are mostly 
made up of 'radio-loud' objects with luminosities below 
the FR break (FR Class I sources). However, the dif- 
ferential source counts change their slope again, i.e. they 
flatten below 1 mJy, and these sub-mJy radio sources 
have often been interpreted as a rising new population of 
objects, which doe s not contribut e significantly at higher 
flux densities (e.g. ICondonlll984l ). 

To date the exact composition of this faint radio pop- 
ulation (hereafter 'population mix') i s not well deter- 
mined , and it is rather controversial. IWindhors t et al.l 
(1985f) suggested that the majority of sub-mJy radio 
sources are faint blue galaxies, presumably undergoing 
significant star forma t ion. Optical spectroscopy, ob- 
tained by iBenn et alj (|1993f) . supported this idea, and 
the source counts at faint levels were successfully mod- 
eled with a po pulation of intermedi ate-redshift star form- 
ing galaxies (ISevmour et al.l 12004). Ho wever, spectro- 
scopic results by IGruppioni et al.l(|1999l ) suggested that 
early-type galaxies were the dominant population at 
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sub-mjy levels. Further, it was recently suggested and 
modeled that the flattening of the source counts may 
be caused by 'radio-quiet' AGN (radio-quiet quasars 
and type 2 AGN), rath er than star forming galaxies 
(jJarvis k, R awlings 200- j); observ ations support this in- 
terpretation ([Simpson et al.ll2006f ) . Based from the com- 
bination of optical and radi o morphology as an ide ntifier 
for AGN and SF galaxies, iFomalont etail (|2006f ) sug- 
gested that at most 40% of t he sub-mjy rad i o sou rces 
are comprised of AGN, while iPadovani et al.l (|2007l ) in- 
dicated that this fraction may be 60 — 80% (the latter 
based their SF/AGN classification on a combination of 
optical morphologies, X-ray luminosities, and radio-to- 
optical flux density ratios of their radio sources). 

Two main reasons exist for such discrepant results. 
First, the identification fraction of radio sources with op- 
tical counterparts, which is generally taken to be repre- 
sentative of the full radio population, spans a wide range 
in literature (20% to 90%) depending on the depth of 
both the available radio and optical data, as well as the 
passband used (e.g. a larger fraction of radio sources are 
associated with NIR than optical data; see Sec. 13. 2[) . 
Second, the methods that were used to separate AGN 
from SF galaxies have been very heterogeneous in the 
past, ranging from pure radio luminosity or morphology 
cuts, through observed color properties to optical spec- 
troscopy. 

The two main populations of radio sources in deep 
radio surveys at 1.4 GHz (20 cm) ar e active galac- 
tic nuclei and star f orming galaxies (jCondonl 119841 : 
IWindhorst et al.l [1985). At this frequency the radio 
emission predominantly arises from synchrotron emis- 
sion powered either by accretion onto the central super- 
mass ive black hole (SMBH) or by supernovae remnants 
(e.g. ICondonl |1992| . note that both mechanisms may be 
at work in a given galaxy). It was shown that radio 
properties such as the distributions of mono-chromatic 
luminosities of SF and AGN galaxies (Seyferts, LINERs) 
are comparable a nd overlapping (at least locally; e.g. 
I Sadler et a l. 1999). Hence, in order to disentangle SF 
and AGN galaxies in the radio regime, observations at 
other wavelengths are required. 

Studies of extra-galactic radio sources in the local uni- 
verse (z < 0.3) have been invigorated due to the re- 
cent advent of panchromatic photometri c and spectro- 
scopi c all-sky su rveys, such as e.g. N VSS (I Condon et al. l 
1998j) . FIRS T dBecker et al.l jl995h , SPS S (lYorkeTafl 
' 2000L IRAS (jBeichman et all 119851 ). 2dF (jColless et al.l 
20011) w hich provide addition al panchromatic photomet- 



ric (e.g. [Simpson ct al. 200(f) a nd/or optical-IR s pectro- 
scopic (e.g. ISadler et alJ 119991 : iBest et al.H2005al ) obser 
vations. For example, the panchromat ic properties of ra- 
dio sources were s tudied to full detail (jlvezic et al.ll2002l : 
lObric et al.l 120061 ) . as well as the environmental d epen- 
dence of radio luminous AGN and SF galaxies dBestl 
20041). and their luminosi t y function ([Sadler et al.1ll99 



zUU4 ) , and tneir luminosi t y tunction i i baaier et aiJUMMM: 
Jackson fc Londishl 120001 : IChan et all l2004t IBest et al l 
2005a). Further, radio emission as a star formation 



universe (i.e. the global star formation rate per unit co- 
moving volume as a function of redshift) was not deter- 
mined with a high accuracy using radio data (see e.g. 
lHaarsma et al.l [20001 : Hlopkins 2004), the radio luminos- 
ity function for SF and AGN galaxies at z > 0.3 is not 
known, and the exact composition of the sub-millijansky 
radio pop ulation is still unknown and a matter of debate 
(ICondonl H98l IWindhorst et all Il985l iGruppioni et all 
19991: ISevmour et all 12004 Uarvis fc Rawlings! 1200 



rate indica tor was well calibrated using a local sample 
(lBellll2003|) and compar ed to other star formation trac- 
ers ([Hopkins et al.ll2003h . 

However, it still remains to uncover the global proper- 
ties of the intermediate-redshift (z < 1.3) radio sources. 
For example, the cosmic star formation history of the 
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In this work and in a number of accompanying papers, 
we will focus on these properties of radio sources us- 
ing th e 1.4 GHz VLA-COSMOS survey (jSchinnerer et all 
2007). The main aim of the current paper is twofold. 
First, we develop a method based only on multi- 
wavelength photometric data to efficiently separate SF 
from AGN galaxies in the VLA-COSMOS 20 cm sur- 
vey. Secondly, we use this classification to derive the 
composition of the sub-mjy radio population. In Sec. [2] 
we describe the COSMOS multi-wavelength data, and 
in Sec. [3] we present the cross-correlation of the sources 
detected at 1.4 GHz with catalogs at other wavelengths. 
In Sec. H] we describe our source classification method- 
ology and introduce our 'rest-frame color based classifi- 
cation method' (see below), which we calibrate and ex- 
tensively test using a large well-characterized sample of 
local galaxies. We present the classification of the VLA- 
COSMOS 1.4 GHz radio sources with identified optical 
counterparts in Sec. [5l and in Sec. [6] we compare our 
classification method with other classification schemes 
proposed in the literature. In Sec. [7] we study the 'popu- 
lation mix' in the VLA-COSMOS radio survey, based on 
the entire sample of VLA-COSMOS radio sources. We 
summarize our results in Sec. [8] 

Throughout the paper we report magnitudes in the 
AB system, and assume the following cosmology: Ho = 
70, fJjvf = 0.3, ft\ = 0.7. We define the radio synchrotron 
spectrum as F v oc v~ a , and assume a = 0.8 if not stated 
otherwise. Hereafter, we refer to our method to classify 
the VLA-COSMOS radio sources into five sub-types of 
objects (star candidate, QSO, AGN, SF, high-z galaxy) 
as "classification method", and to our method to disen- 
tangle only the SF from AGN galaxies, based on rest- 
frame color properties, as "rest-frame color based selec- 
tion method". 

2. THE MULTI- WAVELENGTH DATA SET 

In this section we describe the COSMOS multi- 
wavelength data used for the work presented here. 

2.1. Radio data 

The COSMOS field was observed at 1.4 GHz (20 cm) 
with the NRAO Very Large Array (VLA) in A- and C- 
co nfiguration (VLA-COSM OS Large Project; for details 
see lSchinnerer et al.ll2007l ). The final map, encompassing 
2D°, has a resolution of 1.5" x 1.4", and a mean rms of 
~ 10.5 [15] /iJy/beam in the central 1 [2]D°. 

The VLA-COSMOS source catalog reports the peak 
and total (i.e integrated) flux density for each object. 
For extended sources the total flux den sity is derived by 
integ rating over the object's size (see ISchinnerer et al.l 
120071 for details), while for unresolv ed sources it is set 
to be equal to the peak flux density. iBondi et al.l (|2007f ) 
have shown that bandwidth smearing effects (i.e. chro- 
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matic aberration), combined with the pointing layout 
of the VLA-COSMOS observations, systematically de- 
crease the measured source's peak flux density to ~ 80% 
of its true value, while the total flux density remains 
unaffected. Therefore, to correct for this, all peak flux 
densities in the catalog need to be increased by 25%. 
However, such an effect further entails a necessary re- 
definition of the source s in the field considered to be un- 
re solved (cf. Fig. 14 in lSchinnerer et al.ll2007l and Fig. 2 
in lBondi et al.ll2~007f h Therefore, to properly correct for 
bandwidth smearing eff ects, we have re-sele cted the unre- 
solved sources following lBondi et alJ ()2007l ). and set their 
total flux densities to be 1.25 times their peak (respec- 
tive integrated) flux densities. Throughout the paper we 
will use the integrated flux density, corrected for band- 
width smearing where needed, as the representative flux 
density for each source. 

In order to minimize the number of possible spurious 
radio sources (~ 50% below 5a), we select only objects 
from the catalog that were detected at a signal to noise 
of 5, and are located outside regions contaminated by 
side- lobes from nearby bright sources. This yields 2388 
(out of 3643; i.e. ~ 65%) sources, 78 of which consist of 
multiple components. 

2.2. Near-ultraviolet, optical and infrared imaging data 

The NUV to MIR imaging data and photometry for 
the COSMOS survey used here include data taken dur- 
ing 2003-2006 with ground - (Subaru, KPNO, CTIO, 
CFHT) and space (HST, Spitzer) - based telescopes, cov- 
ering a wavelength range from 3500 A to 8 /zm, described 
them in more detail below. 

2.2.1. Ground-based data 

The data reduction of the COSMOS ground-based ob- 
servations in 15 photometric bands ranging from NUV 
to NIR, and t he generation of the phot ometric catalog, 
is prese nted in ICapak et al.l (|2007t ) and iTaniguchi et al.l 
(|2007al) . Here we make extensive use of the COSMOS 
photometric catalog. The photometric catalog was se- 
lected using the 0.6" resolution i + image. However, the 
photometry was obtained from the PSF (point-spread 
function) matched images, which degrades the resolution 
to ~ 1.8". The median 5a depths in AB magnitudes in 
the catalog for the u* , Bj, g + , Vj, r + , i + , £*, z + and K s 
bands 17 are 26.4, 27.3, 27.0, 26.6, 26. 8, 26.2, 24, 25.2 an d 
21.6, respectively (see also tab. 2 in ICapak et al.|[2007h . 
It is noteworthy that the detection completeness of the 
catalog is above 87% for objects brighter than i = 26. 

2.2.2. Space-based data 

The HST/ACS observations, which cov ered 1.8D° of 
the 2D ° CO SMOS field, are describ ed in iScoville et all 
(|2007aft and lKoekemoer et al.l (|2007l) . The F814W band 
imaging has a resolution of 0.09" a nd a 5a point-sourc e 
sensitivity of Iab — 28.6 (see also ICapak et al.ll2007[ ). 
The ACS so urce catalog, which we u tilize here, was con- 
structed by iLeauthaud et al.l ((2007) , with special care 
given to the separation of point-sources from extended 
objects. 

17 The '+' super-script and 'J' sub-script designate the Subaru 
filters, while the '*' sign stands for CFHT filters. 



The Spitzer observations of the COSMOS field in all 
seven ban ds (3.6, 4.5, 5. 8 8.0 24, 70, 160 fim) are de- 
scribed in lSanders et all (|2007l ). The 3.6 - 8 /xm band 
catalog is available to full depth for the entire field. The 
resolution in the 3.6, 4.5, 5.8, 8.0 fim bands is 1.7", 1.7", 
1.9", and 2", respectively. The catalog was generated us- 
ing SExtractor on the four IRAC channels in dual mode, 
with the 3.6 /an image as the detection image. The 5a 
depth for point-sources at 3.6 fim is 1 /iJy, corresponding 
to an AB magnitude of 23.9. In this work we also make 
use of the MIPS 24 (im catalog obtained from the shallow 
observations of the entire COSM OS field during Cyc le 2 
of the S-COSMOS program (see ISanders et aiTl2007l for 
details). The resolution and 5a depth of the catalog are 
6" and 0.3 mJy, respectively. The latter corresponds to 
an AB magnitude of 17.7. For the purpose of this paper, 
we will use only sources that were detected at 24 /zm at 
or above the 3a level corresponding to their local rms. 

2.3. X-ray data 

The full 2D° COSMOS field was observed with the 
XMM-Newton satellite EPIC camera for a total net 
integration time of 1.4 Ms (for a description of the 
XMM-COSMOS survey see iHasingerlTaTl [200l . The 
limiting flux density of the XMM-COSMOS survey is 
10~ 15 erg cm~ 2 s _1 and 5 x 10~ 15 erg cm~ 2 s _1 in 
the soft (0.5 - 2 keV) and hard (2 - 10 keV) bands, 
respective ly. The X-ray point-s ource detection is de- 
scribed in lCappelluti et all (|2007l ). and the optical iden- 
tifications of the X-ray sources for the first 12 observed 
XMM fields (over a total of 1.3d ) are presented by 
iBrusa et al.l (|2007aD . For the analysis presented here we 
utilize the catalog with 1865 optical counterparts of the 
XMM-COSMOS point sources, drawn from the full 2D° 
XMM-Newton mosaic (jBrusa et al.ll2007bf ). 

2.4. Photometric redshifts 

Th e COSMOS photometric redshifts (jllbert et all 
I2007T ) utilized here are based on an a large amount of deep 
multi- color data (|Capak et al.l l2007bl : rTaniguch i et al.l 
l2007alrbl ): 6 broad optical bands obtained at the Sub- 
aru telescope (u + , g + , r + , i + , z + ) and 2 at CFHT (u* 
and i*), 8 intermediate and narrow band filters from 
the Subaru telescope (7,4427, 7.4464, 7,4505, 7.4574, 
7A709, IA827, NB711, NB816), deep TCs-band data 
from the WIRC AM / CFHT camera (McCracken et al, 
in prep), and 3. 6/xm and 4.5/iw data from the SPITZER 
IRAC camera ((Sanders et al.l I2007D . The photometric 
redshift s are estimated via a standard \ 2 fitting pro- 
cedure (|Arnouts et al.ll2002f) using the code Le Phare 18 
written by S. Arnouts & O. Ilbert. A major feature of 
this method is the calibration of the photometric red- 
shifts using a spectroscopic sample of ~ 1000 bright 
galaxies (Iab < 22 . 5) obt ained as part of the zCOSMOS 
survey (|Lillv et al.ll2007D . We follo w exactly th e same 
calibration method as described in lllbert et al.l (|2006h : 
a) a calibration of the photometric zero-points, and b) 
an optimization of the template SEDs (spectral energy 
distributions). This calibration method allows us to re- 
move systematic offsets in the estimates of the photomet- 
ric redshifts. A direct comparison between the photomet- 
ric redshifts and the zCOSMOS spectroscopic redshifts 

18 www.lam.oamp.fr/arnouts/LE_PHARE.html 
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Fig. 1. — The comparison of the photometric and spectroscopic 
redshifts for the VLA-COSMOS radio sources with optical coun- 
terparts (see Sec. 13.11 1 for which spectroscopy is available. The 
distribution shown was limited to spectroscopic redshifts < 1.3, 
consistent with the redshift range used in this work. The number 
of sources (#), mean (fi), and standard deviation (a) of the distri- 
bution are indicated in the top left of the panel. Note the excellent 
accuracy of the photometric redshifts. 

shows that the photometric redshifts reach an accuracy 
of a ~ 0.014 at i < 22.5. The fraction of catas- 

trophic failures is less than 1% at i < 22.5. Such an accu- 
racy and robustness can be achieved thanks to both the 
intermediate bands and deep NIR photometric data. The 
photometric redshifts for the ent ire COSMOS popu lation 
will be described in full detail in lllbert et alj ((2007). The 
galaxies in the sample used here are radio selected, i.e. 
they are not randomly drawn from the global COSMOS 
population. Therefore, in Fig. [T] we show the compar- 
ison of the photometric and spectroscopic redshifts for 
a sub-sample of our VLA-COSMOS sources with avail- 
able spectroscopy (see next section). The accuracy is 

a = 0.027, which is somewhat lower than the ac- 

curacy for the full sample of COSMOS sources, however 
it is still satisfactory. 

As photometric redshift codes generally take into ac- 
count only galaxy SED models, the photometric redshifts 
for broad line AGN are usually poorly estimated (not 

better than a (^j) ~ 0.1 with a large fraction of catas- 
trophic outliers and no solutions found beyond z ~ 1.1), 
and alternative ways for their redshift computations have 
to be applied. At the time of writing, no accurately es- 
timated photometric redshifts for broad line AGN exist 
for the COSMOS project. The photometric redshifts for 
broad line AGN will be presented in a future publication 
((Salvato et al.ll2007t ). 

2.5. Optical spectroscopic data 

The ongoing CO S MPS optical spectroscopic surveys 
(|Trump et alj|2007t llmpev et all 120071 : iLillv et alj|2007l) 
provide to date 657 spectra, with good redshift esti- 
mates, for objects in the VLA-COSMOS 1.4 GHz radio 
sample described in Sec. 12.11 We augment this spectro- 
scopic data set with available spectroscopic information 
for 65 galaxies from the SDSS DR4 "main" spectroscopic 
sample, 13 objects fro m the SDSS DR5 quasar catalog 
((Schneider et al.ll2005h . 2 sources from the 2dF survey, 
as well as for 27 objects taken with the MMT 6.5 m tele- 



scope, and presented bv lPrescott et alj ((20061 ). Thus, a 
total of 764 spectra is available. However, as a num- 
ber of sources were spectroscopically observed multiple 
times, we have spectroscopic information for 520 unique 
sources in our radio sample. Throughout the paper, we 
use the spectroscopic redshifts, where available. We also 
use this sub-set of radio sources with observed optical 
spectra as a control sample to verify the presented clas- 
sification method. 

3. VLA-COSMOS 1.4 GHZ RADIO SOURCES AT OTHER 
WAVELENGTHS 

In this section we define the 'matched' radio source 
sample, a sample of radio sources with optical coun- 
terparts cross-correlated with the panchromatic COS- 
MOS observations, as well as the 'remaining' radio source 
sample, both of which will be used throughout the pa- 
per. First, we restrict the full VLA-COSMOS radio 
source sample to objects which have optical counterparts 
(Sec. I3.ip . Then we positionally match these objects 
with sources detected in the MIR (Sec. 13. 2|) and X-ray 
(Sec. 13. 3|) spectral ranges. In Sec. 13. 41 we describe the re- 
maining radio sources that are either without identified 
or with identified but flagged optical counterparts. 

3.1. Positional matching of the COSMOS radio and 
NUV/optical/NIR catalogs 

The VLA-COSMOS Large Project source catalog con- 
tains 2388 radio sources detected at a signal to noise 
5 and outside side-lobe-contaminated regions (see 
Sec. l2.lj) . 78 of these consist of multiple components. For 
the purpose of this paper we match these radio sources 
with sources that have also been detected in the optical 
regime, and are reported i n the COSMOS photometric 
catalog (Cap ak~et alj 12007). In order to obtain a sam- 
ple with reliable radio-optical counterparts, we position- 
ally match the radio sources only with optical sources 
brighter than i — 26. The reason for this is illustrated 
in Fig. [21 where we show the distance between the radio 
sources and their nearest optical counterparts as a func- 
tion of the i band magnitude. As the median distance 
rapidly increases for i > 26 (most probably introducing 
a significant number of false match associations) we ap- 
ply a cut of i = 26 to the NUV-NIR photometric catalog 
before matching the radio and optical catalogs. 

Hence, to find the corresponding optical counter- 
part for each radio source (excluding multi-component 
sources, which are separately addressed below), we 
search for the nearest optical neighbor within a radial 
distance of 0.5". The search radius was chosen in such a 
way that it balances a high completeness of true matches 
and a low false-match contamination rate: A cut-off of 
0.5" essentially includes all true matches in the sample, 
with a false association rate (computed from the source 
density in the matched catalogs) of only ~ 4%. The 
high completeness and low contamination are due to the 
excellent astrometric accuracy of both the COSMOS ra- 
dio and optical data. Our matching yielded 1749 ra- 
dio sources with securely identified optical counterparts. 
However, 252 (~ 15%) of these are located in masked- 
out regions (i.e. around bright saturated stars) in the 
photometric catalog. Thus, their NUV-NIR photometry, 
as well as the photometric redshift computation has a 
significantly reduced accuracy. We exclude these objects 
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Fig. 2. — Distance as a function of i band magnitude for radio 
sources in the VLA-COSMOS survey (with S/N > 5) positionally 
matched to the closest optical counterpart (small dots). The large 
dots show the median distance for each magnitude bin (the width 
of the bin is indicated by horizontal lines) and the corresponding 
interquartile range (vertical lines). Note that the median distance 
is significantly larger beyond i = 26, presumably introducing a 
significant number of false match associations. The shaded area 
indicates the allowed matching region, which is set by the matching 
radius (0.5") and the magnitude cut-off (i = 26) used to find secure 
optical counterparts for the radio sources in the sample (see text 
for details). 

from our main sample. 

The multi-component radio sources in the VLA- 
COSMOS survey consist of radio sources which could 
not be fitted using a single Gaussian function (see 
iSchinnerer et al.l l2007f ) . The radio morphologies of such 
sources can be fairly complex (e.g. single or double lobed 
radio galaxies), and this makes it substantially more dif- 
ficult to associate such radio source s with the appro- 
priate opt i cal co unterparts (see e.g. Ilvezic etldl 120021 : 
iBest et aT]|2005aD . In order to avoid any biases which 
may be caused using an automatic association procedure, 
the optical counterparts of the VLA-COSMOS multi- 
component radio sources were determined visually. The 
1.4 GHz catalog contains 78 multi-component sources de- 
tected at or above 5a, and 65 were securely associated 
with an optical counterpart with i < 26, however 4 are 
located inside masked-out areas (around bright saturated 
stars) in the photometric catalog, and we therefore ex- 
clude them from the main sample. 

In summary, the applied matching criteria yield 1814 
(~ 76% of the 2388 radio sources with S/N > 5) radio 
sources with secure optical counterparts down to i = 26, 
65 of which are multiple component sources. The most 
accurate NUV-NIR photometry (i.e. excluding flagged 
regions around saturated objects) was obtained for 1558 
(~ 86% of 1814), 61 of which are multiple-component 
radio sources. Hereafter, we refer to this latter sample 
of 1558 radio sources, which make up ~ 65% of the ra- 
dio sources with S/N > 5, and that were matched to the 
NUV-NIR catalog, as the 'matched' radio source sample. 
For reference, the 1.4 GHz total flux density distribu- 
tions for the complete radio sample, the matched radio 
sample, and the sub-sample with available spectroscopy 
is shown in the top panel in Fig. [3] The distribution 
of the i band magnitude for the matched radio sam- 
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Fig. 3. — Top panel: The distribution of the total 1.4 GHz 
(20 cm) flux density for i) the complete radio source sample 
(S/N > 5; dark-grey shaded histogram), ii) the matched radio sam- 
ple (grey-shaded histogram) and iii) the spectroscopic sub-sample 
(light-grey shaded histogram). The inset shows the cumulative dis- 
tribution for the three samples computed as a function of decreasing 
total flux. Note that the spectroscopic sub-sample fairly represents 
the faint radio population as a function of the total flux. Bottom 
panel: Distribution of the i band magnitude (Subaru where avail- 
able, otherwise CFHT) for sources in the VLA-COSMOS matched 
radio source sample, and the spectroscopic sub-sample in the same 
notation as in the top panel. 

pie, as well as the spectroscopic sub-sample, is shown 
in the bottom panel in Fig. [3] It is also worth noting 
that our cross-correlation is consistent with the results of 
the maximum likeli hood ratio techniqu e applied to VLA- 
COSMOS sources ijCiliegi et alJl2007aT >. however our re- 
strictions for the masked-out regions in the photometric 
catalog, as well as the optical magnitude limit, are more 
conservative, as the analysis presented here strongly re- 
lies on accurate NUV to NIR photometry. 

A further data set that we use in the analysis pre- 
sented here is the HST/ACS point-source information. 
We extract this information for each radio source in 
our matched sample by positionally matching the op- 
tical counterparts of the radio sources with point- 
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sources identified in the HST/ACS F814W source catalog 
(jLeauthaud et al.ll2007| ). Using a matching radius of 0.5" 
yields 47 objects in our matched radio sample classified 
as point sources based on the HST/ACS F814W images. 
The mean distance between the matched objects is only 
(0.12 ±0.07)". 

3.2. Radio - optical sources with IRAC and MIPS 

detections 

We cross-correlate the matched radio source sample 
with the S-COSMOS - IRAC catalog using a maximum 
allowed distance to the optical counterparts of our ra- 
dio sources of 0.5". [Note that such a cross-correlation 
allows for a maximum distance between the radio and 
MIR sources to be 1".] Such an adopted search radius 
essentially selects a complete radio - optical - MIR sam- 
ple with a false match association for the MIR sources 
of < 1% with the optical counterparts, and < 4% with 
the radio counterparts. In summary, out of 1558 radio 
sources in the matched radio sample, 1448 (93%) have 
secure MIR counterparts. 

The 24 (im flux densities for all our radio sources were 
obtained from the COSMOS field observations using the 
S-COSMOS - MIPS shallow survey with a resolution of 
6". Although a relaxed search radius of 5" was used to 
find the radio - 24 /zm counterparts, the median distance 
is only 0.19" with an interquartile range of 0.16". About 
50% (799 out of 1558) sources in the matched radio sam- 
ple have a MIPS counterpart at 24 /j,m with a signal to 
noise at or above 3. 

3.3. Radio - optical sources with point-like X-ray 

emission 

Using the max imum likelihood ratio technique 
iBrusa et al.l (|2007af ) present ed the optical identifica tions 
of the X-ray point-sources (jCappelluti e t al. 200 7j) de- 
tected in the XMM-COSMOS survey (jHasinger et all 
l200l . Here we utilize their identifications to match the 
sources in our matched radio sample with detected X- 
ray point sources. Out of 1558 radio sources with op- 
tical counterparts, 179 (12% of 1558) are identified as 
point-sources in the X-ray bands. 17 of the se have mul- 
tiple co unterpart candidates as defined by IBrusa et alJ 
(|2007al ). In these cases, if we assume that the radio 
sources are physically associated with the X-ray sources, 
then the radio data, which have a significantly better 
astrometric accuracy, can be used to constrain more pre- 
cisely the optical counterpart of this given object. A vi- 
sual inspection of the 17 s ources, classi fi ed as h aving am- 
biguous identifications by IBrusa et al.l (|2007bD . strongly 
suggests that their most probable optical counterparts, 
reported in the X-ray - optical catalog, are real associa- 
tions. Hence, we proceed in our analysis taking all 179 
X-ray detected point sources to be true counterparts of 
the objects in the matched radio sample. 

3.4. Radio sources with photometrically flagged or 
without optical counterparts at other wavelengths 

In Sec. 13.11 we have defined the matched radio sample 
which consists of 1558 1.4 GHz sources that have opti- 
cal counterparts out to an i band magnitude of 26, and 
within a radial distance of less than 0.5". These sources 
were also required to have the most accurate NUV-NIR 



photometry, i.e. counterparts within flagged regions due 
to saturation and blending effects in the NUV-NIR im- 
ages were excluded. Thus, 830 radio sources remain with 
no identified optical counterparts within these limits, 256 
(i.e. ~ 30%) of which have counterparts with i < 26 that 
lie in masked-out regions. Hereafter, we will refer to this 
sample of sources as the 'remaining radio source sam- 
ple'. We positionally match these sources to the IRAC 
catalog using a maximum allowed distance of 1", and 
find 610 (~ 75%) matches. Based on Poisson statistics 
and the source density of the MIR sources, such a search 
radius essentially includes all true matches with a false 
contamination rate of < 4%. It is worth noting that 
more than one half of the remaining 25% of the radio 
objects were independently identified as possible spuri- 
ous sources, based on visual inspection, while the other 
half are either located in blended regions in the IRAC 
images or slightly further away than the allowed 1" from 
the position reported in the IRAC catalog (the morphol- 
ogy of the IRAC sources being often extended). Thus, 
we consider these ~ 75% of radio - MIR matches repre- 
sentative of the entire remaining radio population. Out 
of the 830 sources 318 (i.e. ~ 40%) have MIPS 24 /im 
detections (S/N > 3), and 31 (~ 4%) have XMM point 
source counterparts (these 31 objects are a sub-sample 
of the 256 objects in the flagged regions). In Sec. [7] we 
analyze the properties of these remaining sources, and 
their contribution to the 'population mix' in the VLA- 
COSMOS survey. The summary of the multi-wavelength 
cross-correlation of the VLA-COSMOS radio sources is 
given in Tab. [1] 

4. CLASSIFICATION METHODOLOGY 

Extragalactic radio sources consist of two main pop- 
ulations: star forming and AGN galaxies. We further 
divide the AGN class into two sub-classes: QSOs (of- 
ten unresolved in optical images, with broad emission 
lines in their spectra and high optical luminosity) and ob- 
jects where the AGN does not dominate the entire SED, 
such as Type 2 QSOs, low-luminosity AGN (Seyfert and 
LINER galaxies) and absorption-line AGN (resolved in 
optical images, with both broad, narrow or no emission 
lines in their optical spectra). Throughout the paper, we 
will mostly refer to the latter sub-class only as 'AGN'. 

The commonly adopted and well-calibrated tool for 
disentangling SF galaxies from low-luminosity AGN 
(Seyfert and LINERs) is the optical spectroscopic 
diagn ostic diagram (|Baldwin. Phillips &: Terlevichl 
1981). that is based on two emission line flux ra- 
tios ([OIII 5007] /H/3 vs [Nil 65841 /Ha; he r eafter 
BPT diagram; see also Q /cillcux & Ostcrbrock 119871 : 
iRola. Terlevich fc Terlevichl Il997t iKewlev et all 120011 ). 
This diagnostic tool has been extensively used in the 
past for a successful s e parat io n of local SF and . AGN 
galaxies (ISadler et al.l |1999t iKauflmann et al.l 120031: 
Brinc hmann et aLll2004l : lObric et al.ll2006l : ISmolcic et all 
20061) . However, as spectroscopic observations are very 
expensive in terms of telescope time, especially when 
large numbers of faint objects need to be observed, 
alternative methods for the separation of SF from AGN 
galaxies, that eliminate the need for spectroscopy, have 
to be invoked. Here we develop such a method (hereafter 
'rest-frame color based classification method'), which 
we apply in the next sections to identify SF and AGN 
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galaxies in the VLA-COSMOS matched radio sample. 
The main idea of our method is drawn from the findings 
that the overall NUV to NIR SED of galaxies is a 
one-parameter family, and that spectral diagnostic 
parameters, such as line strengths, appear to be well 
corre l ated with the overal l galaxy's SED (s eelObric et al. 
120061; ISmolcic et al.|[20M) . In particular. ISmolcic e~ 
( 20061 ) have found a tight correlation between rest-frame 
colors of emission-line galaxies and their position in the 
BPT diagram. This correlation thus provides a powerful 
tool for disentangling SF from AGN galaxies using only 
photometric data, i.e. rest-frame colors, and we utilize 
it as the key of our rest-frame color based classification 
method. 

4.1. Calibration of the rest-frame color based 
classification method in the local universe 
4.1.1. The local spectroscopic sample 

In order to obtain insight into the efficiency of the clas- 
sification method, based on the rest-frame color (PI; 
see below), we construct a large sample of well-known 
low-redshift galaxies (0.01 < z < 0.3), whose prop- 
erties are assumed to present well the properties of 
the galaxies in the VLA-COSMOS matched radio sam- 
ple. The local sample was generated from the SDSS 
main galaxy sample, positionally matched to sources 
detected in the 1.4 GHz NVSS survey. Additionally, 
a sub-sample of galaxies detecte d with IRAS was con- 
structed (see lObric et al.l [2006 for details about the 
cross-correlation of the SDSS, NVSS, IRAS catalogs). 
The SDSS/NVSS sample contains 6966 galaxies and the 
SDSS/NVSS/IRAS sample 875 galaxies with available 
SDSS optical spectroscopy. The computation of rest- 
frame colors ( P1,P2 ) for these galaxies is presented in 
ISmolcic et al.l (|2006| ). The rest-frame colors PI and 
P2 optimally quantify the distribution of galaxies in 
the rest-frame color-color space. They are derived from 
the modified Stromgren photometric system (uz,vz,bz,yz 
encompassing th e wavele ngth range of 3200 - 5800A; 
lOdell et al.ll2002t see also ISmolcic et al.l I2006D via a 2- 
dimensional principle component analysis in color-color 
space. A more detailed description of the colors, includ- 
ing their equational form, is given in Appendix |A"1 

It is noteworthy to mention that given a) the detec- 
tion limits, and b) the areal coverage of the NVSS and 
VLA-COSMOS surveys, both surveys observe approxi- 
mately the same populations of objec ts, although over 
differ ent redshift intervals (see Fig. 1 in lSchinnerer et all 
2007). Assuming that evolutionary effects with redshift 
do not significantly alter the reliability of the identifica- 
tion method presented below, at least out to z = 1.3, this 
makes the local sample of galaxies representative of the 
galaxies in the VLA-COSMOS matched radio sample. 

Based on spectral line properties we separate the local 
sample into three classes of objects: AGN, star-forming 
galaxies and composite objects, where the latter are con- 
sidered to have a comparable contribution of both star 
formation and AGN activity. First, galaxies with emis- 
sion lines in their spectra are separated into these three 
classes using their position in the BPT diagram (see bot- 
tom panel in Fig. [4j). 19 Second, as the galaxies that have 

19 Note that the classification of AGN, SF, and composite galax- 
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Fig. 4. — Top panel: The distribution of ~ 3, 400 SDSS emission 
line galaxies, in the redshift range of 0.01 to 0.3, drawn from the 
DR1 "main" spectroscopic sample, which were also detected by the 
1.4 GHz NVSS survey, in a diagram constructed with the principal 
rest-frame colors (Pl,P2). Each dot corresponds to one galaxy, 
and the color code is determined by the position in this plane. 
The bottom pan el shows the distribution of these ga laxies in the 
(BPT) diagram l IBaldwin. Phillips fc Terlevichl H981). The lower 
dashed line separa tes the regions populat ed by star-forming and 
composite galaxies (Kauffmann et al. 2003). The upper dashed line 
separates the region s populated by composite and AGN galaxies 
(Kcwlcv et al. 2001). The dots are colored according to their PI 
and P2 values given in the top panel. Note the strong correlation 
between the rest-frame colors and the emission line flux ratios (see 
text for details). 

no emission lines in their spectra cannot be star forming 
(see also Sec. 14.1.31 for a discussion of this point), and as 
all of the objects in the SDSS/NVSS sample are observed 
to have 1.4 GHz emission which arises either from AGN 
or star formation activity in a galaxy, we define galaxies 
without emission lines in their spectra as AGN (see also 
iBest et aT1l2005al who classified these types of objects as 
absorption line AGN). 

4.1.2. Completeness and contamination due to the 
photometric selection 

In the top panel in Fig. [4] we show the distribu- 
tion of ~ 3,400 SDSS/NVSS emission line galaxies in 

ie s based on the BPT d iagram changed compared to the one used 
in ISmolcic et"aLl | |2006|) . 
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the (P1,P2) rest- frame color-color diagram. The color 
code is determined by the position of a galaxy in this 
plane. The bottom panel shows the BPT diagram for 
the same galaxies with the colors adopted from the up- 
per panel. For these radio luminous galaxies, a strong 
correlation exists between their rest-frame optical col- 
ors and emission line properties, in particular between 
PI and log ([Nil 6584] /Ha). We want to stress that the 
SDSS/NVSS galaxies with no emission lines in their spec- 
tra have typically red PI colors, with a median PI value 
of 0.46 an d an interquartile ra nge of 0.13 (see also e.g. 
Fig. 12 in ISmolcic et alJl2006ft . This implies that the 
rest-frame color PI can be used as an efficient separator 
between AGN and SF galaxies in samples where these 
t wo types of gal a xies a re the two dominant populations. 

ISmolcic et al.l (|2006f) argued that the P2 color is a 
proxy for the galaxies' dust content, with higher values 
of P2 corresponding to higher dust attenuation. How- 
ever, as the dynamic range of P2 is very narrow, we will 
rather base our selection on the PI color alone. Here- 
after, based on a compromise between completeness and 
contamination of the photometrically selected samples 
of galaxies, we choose a color-cut of Pl=0.15 as the di- 
viding photometric value between star forming galaxies 
(Pl<0.15) and AGN (Pl>0.15). Given this boundary 
we infer that the sample of photometrically selected star 
forming galaxies is 85% complete, and contaminated by 
AGN and composite objects at the 20% and 10% lev- 
els, respectively. Vice versa, the photometrically selected 
sample of AGN is 90% complete and contaminated by SF 
and composite galaxies at the 5% and 15% level, respec- 
tively (see Appendix IB . 1 1 for computational details). We 
will use these estimates in the following analysis to statis- 
tically correct the photometrically selected SF and AGN 
samples in the VLA-COSMOS survey. 

4.1.3. Is the rest-frame color based selection method biased 
against dusty starburst galaxies? 

One of the main advantages of radio observations is 
that the intrinsic physical properties that drive the ra- 
dio emission can be derived without any need for dust- 
extinction corrections (as radio emission passes freely 
through dust). In particular, radio observ ations provide 
a dust-unbiased view of star- formation (sec ICondonf l992 
for a review; see also lHaarsma et all 120001 ) . Hence, it is 
important to address whether our rest-frame color selec- 
tion technique misses out dusty starburst galaxies. In 
order to do this we study a sub-sample of SDSS/NVSS 
galaxies that were also detected with the IRAS satellite 
at IR wavelengths. 

First we address the composition of the 'missed' ~ 15% 
of the SF galaxies (note that the photometrically se- 
lected SF galaxy sample is 85% complete) in order to 
show that our selection does not introduce biases against 
the most luminous starburst galaxies. About 30% of 
the SF galaxies with Pl>0.15 have IRAS detections, 
which is consistent with the fraction of IRAS detections 
in the SF galaxy sample with Pl<0.15. This suggests 
that the composition of the missed SF galaxies is not 
significantly different from the composition of the se- 
lected SF galaxies. Second, the fractions of luminous 
and ultra-luminous IR galaxies (LIRGs and ULIRGs) in 
the spectroscopically classified SF galaxy samples with 
Pl<0.15 and Pl>0.15 appear to be consistent with 



each other. And third, the completeness and contam- 
ination of the SDSS/NVSS/IRAS galaxy sub-samples 
selected using the rest-frame color based classification 
method is fairly consistent with the properties of the en- 
tire SDSS/NVSS sample (see Appendix E21 for details). 
All of this implies that the selection criteria for SF and 
AGN galaxies adopted on the basis of the analysis of the 
full SDSS/NVSS sample works almost equally efficiently 
for the IRAS detected sub-sample. Even further, only 
- 10% and - 5% of star forming LIRGs and ULIRGs 
galaxies, respectively, are omitted by the method (see 
Appendix IB . 21 for details). 

It is noteworthy that in the entire SDSS/NVSS/IRAS 
sample only 48 objects (i.e. ~ 5%) were identified as ab- 
sorption line AGN, i.e. having no emission lines in their 
optical spectra. It is possible that very high dust obscu- 
ration may suppress the detection of emission lines in the 
optical spectrum. A visual search for signatures of HII 
regions in the 48 SDSS color-composite images suggested 
that at the most 30% of these galaxies may possibly be 
undergoing star formation (e.g. possible galaxy merger, 
or extended morphology). Therefore, only a negligi- 
ble fraction of less than 1.5% of the SDSS/NVSS/IRAS 
galaxies may be so heavily dust-obscured that no emis- 
sion lines would be detected in their optical spectra. 

In order to test this issue further, we have synthesized 
the PI color for the 'standard' dusty starburst galaxies 
M 82 (a typical LIRG), and Arp 220 (the prototypical 
ULIRG) using spectral templates given by Polletta et 
al. (2006). The optical to NIR part of these templates 
was gener ated using the ste llar population synthesis code 
GRASIL (|Silva et al.lH"998h . The derived PI colors for 
M 82 and Arp 220 are 0.078 and 0.149, respectively, im- 
plying that M 82 would not have been missed by the rest- 
frame color based classification method, while Arp 220, 
although close to the adopted limit in PI, still lies within 
our selection criterion (note that in reality observational 
photometric errors of M 82- and Arp 220- like objects will 
introduce a scatter in PI; see also Sec. 14.2.2]) . Based on 
the tests presented above we conclude that our rest-frame 
color based classification method is not significantly bi- 
ased against dusty starburst galaxies. 

4.2. Application of the rest-frame color based 

classification method to the multi-wavelength 
photometry of the VLA-COSMOS radio - optical 
galaxies 

In the previous section we have studied, and calibrated, 
the rest-frame color based classification method using the 
local galaxy sample. Assuming that evolutionary effects 
with redshift do not significantly affect the reliability of 
the classification, we can safely apply it to the galaxies 
in the VLA-COSMOS 1.4 GHz matched radio sample 
(Sec. I5.4|) to separate SF from AGN galaxies at inter- 
mediate redshifts. However, first we need to derive the 
rest-frame PI color from the observed SED of the VLA- 
COSMOS galaxies. We do thi s via high-resolution SED 
fitting, described in Sec. 14.2.11 and we test the accuracy 
of the rest-frame color synthesis in Sec. 14.2.21 

4.2.1. Derivation of rest-frame colors 

In order to estimate the rest-frame color PI for each 
galaxy in the matched radio sample that we do not clas- 
sify as a star or QSO (see Sec. [5]), we use the GOS- 
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SIP (Galaxy Observed Si mulated SEP In teractive Pro- 
gram) software package (jFranzettil 120051 ) , designed for 
fitting a galaxy's SED to a set of chosen spectral mod- 
els. The SED of the galaxies in our sample, that we use 
for fitting, extends from 3500 A to 2.5 /im (comprised in 
6 photometric pass-bands) and we fit to each observed 
SEP a realization o f ~ 1 00, 000 spectra built using the 
iBruzual fc Charlotl (|2003h stellar synthesis evolutionary 
models. Star formation histories have been parameter- 
ized by an underlying continuous star formation history 
(decaying exp onentially), and r ando mly superimposed 
bursts (see alsolKauffmann et al.|[200l iKong et al.ll2004j : 
Sali m et al.ll200l . We cover ages between 100 Myr and 
13.5 Gyr, specific star formation rates (star formation 
rate per unit galaxy stellar mass) between 10~ 15 yr~ x 
and 3.93 x 10 -8 yr _1 and metallicities from a tenth to 
twice solar. 

For each object in our sample the model spectra in our 
library are redshifted to the galaxy's measured redshift 
(spectroscopic where available, otherwise photometric), 
then each spectrum is convolved with the observed filter 
response function 20 , and then fitted to the available ob- 
served photometric data, using a direct x 2 minimization 
procedure. Output parameters, such as e.g. rest-frame 
colors, stellar mass or the 4000 A break, are taken from 
the best fit model spectrum. In order to derive physi- 
cally meaningful output parameters, we restrict the fit- 
ting procedure to models that have an age smaller than 
that of the Universe at the galaxy's redshift. 

4.2.2. Accuracy of the derived rest-frame colors 

The synthesized, relative to observed, colors and mag- 
nitudes for the VLA-COSMOS galaxies, computed as de- 
scribed above, are reproduced with a satisfying accuracy 
of ~ 0.07 and ~ 0.04, respectively. To further test the 
accuracy of the derived rest-frame color, we synthesize 
PI for a sample of - 1700 local SPSS/NVSS galaxies. 
For these galaxies the PI colors, computed from their 
spectrum (with an accuracy o f better than 0.03 magni- 
tudes; see ISmolcic et afl r2006). are also available. Thus, 
comparing PI synthesized via SED fitting with the ref- 
erence PI derived from the spectrum gives us a direct 
measure of the achieved accuracy, shown in Fig. O (top 
panel). The root-mcan-squarc-scattcr is ~ 0.1. 

In the bottom panel in Fig. [5] we show the difference of 
the PI color as a function of the synthesized (GOSSIP) 
PI color for the SPSS/NVSS galaxies. A slight system- 
atic trend is present as a function of the derived PI color. 
This trend presumably arises from the presence of strong 
emission lines in the galaxies' SEPs (in the rest-frame 
wavelength range that is used to derive PI), which are 
not taken into account in the BC03 model spectra. In 
our further analysis of the VLA-COSMOS data, we use 
the median offset, shown in the bottom panel in Fig. 
to correct the derived PI color, and we consider the syn- 
thetic PI color to be accurate to ~ 0.1 mag. However, 
an error of 0.1 mag for 68% of the galaxies, and 0.2 for 
95%, may substantially alter the SF/AGN selection, in- 
troducing the largest uncertainty for the galaxies that 
have PI colors close to the chosen PI boundary. To ac- 
count for these uncertainties, we simulate the PI errors 

20 The C OSMOS filter resp onse curves can be found here: 
http: / / www. astro, caltech.edu/"" capak/cosmos / filters 
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Fig. 5. — The comparison of the PI color synthesized from 
the SDSS spectrum (PIsdss that has a typical error of 0.03) and 
via SED fitting (PIgossip) for ~ 1700 SDSS/NVSS galaxies from 
the DR1 "main" spectroscopic sample (top panel). The PI color 
is reproduced with a spread of ~ 0.1. The bottom panel shows 
the difference in the PI color as a function of PIgossip- The 
thin solid lines show the upper PI limits of 0.6, imposed for this 
analysis. Note the slight systematic trend in the derived PI color 
as a function of PIgossip (the large dots are the median offsets, 
with indicated la error bars). The trend presumably arises due to 
the presence of strong emission-lines in the rest-frame wavelength 
range of 3800 — 5800 A, that are not taken into account in the BC03 
models. We use these offsets to correct for these systematics in the 
derivation of PI for the clean radio sample (see text for details). 

using a randomly drawn Gaussian distribution with a 
standard deviation of 0.1 centered at zero (see top panel 
in Fig. |5j). These errors are then added to the galax- 
ies' PI colors derived from the best fit template in the 
SEP fitting, and the SF/AGN selection (see Sec. \EM is 
applied. By repeating this procedure 10,000 times we 
obtain a robust statistical distribution of the number of 
selected SF and AGN galaxies yielding 356 SF and 585 
AGN galaxies (mean) with a root-mean-square scatter of 
only 7. 

Applying our SF/AGN selection using the PI distri- 
bution obtained from the best fit template in the SEP 
fitting yields 340 SF and 601 AGN galaxies (see Sec.EU). 
Thus, ~ 5% less SF galaxies are selected. This is easily 
understood as the blue tail of the PI distribution con- 
tains a smaller number of galaxies than there are in the 
prominent red peak (see e.g. Fig. [24]) . Therefore, a nor- 
mal error distribution will systematically scatter more 
galaxies to the blue PI region, than to the red one. We 
conclude that the photometric errors of the synthesized 
PI color introduce a number uncertainty of ~ 5% in fa- 
vor of SF galaxies. Although ~ 5% is not significant, 
it is necessary to keep this bias in mind in the analysis 
of the 'population mix' of submillijansky radio sources 
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(Sec. EH)- 

5. CLASSIFICATION OF VLA-COSMOS SOURCES IN THE 
MATCHED RADIO SAMPLE 

In this section we present the classification of the en- 
tire VLA-COSMOS matched radio sample into star can- 
didates (Sec. El, QSOs (Sec.EHJ), SF, AGN and high-z 
galaxies (Sec. I5.4j) . We begin with a summary of the 
sample definitions (Sec. EH- 

5.1. Outline and nomenclature 

In Sec. 14. II (see also Appendix [B|) we have presented the 
calibration and effectiveness of the rest-frame color based 
classification method for separating galaxies dominated 
by star formation from those dominated by AGN activ- 
ity. For this we have used the SDSS "main" spectroscopic 
sample - a pure galaxy sample th at by definition doe s 
not contain any star-like objects (|Strauss et al.l I2002T) . 
This, obviously, implies that the same effectiveness of the 
method can only be reached if the rest-frame color based 
classification method is applied to a comparable sample, 
i.e. galaxies only. However, our VLA-COSMOS matched 
radio sample consists not only of galaxies, but also of 
stellar like sources, where the latter are either stars or 
quasi stellar objects (QSOs). Therefore, we classify the 
sources in the matched radio sample into five sub-types - 
a) star candidates, b) quasi stellar objects (QSOs), c) ac- 
tive galactic nuclei (AGN), d) star forming (SF), and e) 
high redshift (z > 1.3; high-z) galaxies. The latter three 
sub-types compose our "VLA-COSMOS galaxy sample". 
The properties of each sub-type are summarized as fol- 
lows. 

Stars: Point-sources in the optical, with their 
SEDs best fit using a stellar template. 

OSQs: Point-sources in the optical (stellar-like 
SEDs are excluded; see above). This criterion es- 
sentially requires that the emission of the nucleus in 
the optical strongly dominates over the emission of 
the host galaxy. Thus, this sample predominantly 
contains broad line AGN (i.e. type-1 AGN), with 
power law spectra in the optical. 

AGN: Galaxies (not point-sources) whose rest- 
frame color properties are consistent with proper- 
ties of AGN (PI > 0.15, X-ray luminosity above 
10 42 erg s -1 if X-ray detected). This selection re- 
quires that the optical emission either shows signs 
of both, the emission from the central AGN as well 
as the emission from the underlying host galaxy, 
or only the latter. Thus, this sample essentially in- 
cludes Seyfert and LINER types of galaxies, as well 
as absorption line AGN, and we limit it to rcdshifts 
of z < 1.3. 

SF galaxies: Galaxies whose rest-frame color 
properties are consistent with properties of star 
forming galaxies (PI < 0.15). Thus, the emission 
of these galaxies is dominated by the emission orig- 
inating from regions of substantial star formation. 
This sample is also limited to redshifts < 1.3. 

high-z galaxies: Galaxies (not point-sources) 
with redshifts beyond z — 1.3. 



5.2. Star candidates 

In order to identify star candidates in the VLA- 
COSMOS matched radio source sa mple, we m a ke us e 
of the COSMOS stellar catalog (|Tasca et alJ l2007h . 
that was constructed from the HST/ACS catalog 
(|Leauthaud et al.ll2007f) using stellar templates to fit the 
entire SED of each source. In Fig. [6] we show the color- 
color distribution for ~ 2, 000 objects in the COSMOS 
field securely classified as stars (with photometric errors 
better than 0.05), which form well defined loci in the 
broad-band color-color diagrams. Cross-correlating our 
matched radio sample with the COSMOS stellar catalog 
yields only 2 objects detected in the radio regime that are 
consistent with having stellar properties. The color prop- 
erties of these objects are shown in Fig. [6l Within the 
error-bars they are consistent with the main stellar loci. 
Note, however, the r + -i + color excess of one of the star 
candidates in the r + -i + vs. g + -r + color-color diagram 
(middle panel in Fig. [5]) , which suggests co nsistency with 
properties o f e.g. cataclysmic variables (e.g. lSzkodv et al.l 
l2002t [20031 ). or unresolved binary star sy stems contain- 
ing a white dwarf and a late type star (e.g. ISmolcic et al.1 
l2004h . The best fit stellar templates for these objects 
were taken from the PHOENIX library (jHauschildt et all 
Il997f l and represent dwarfs with effective temperatures in 
the range of 4100 to 5000 K and log (g) in the range of 3 
to 3.5. The 1.4 GHz total flux densities for these two ob- 
jects are 126 and 152 /xJy, and the corresponding i band 
AB magnitudes are 25.34 and 23.28, respectively. It is 
noteworthy that both objects have IRAC counterparts, 
but no associated X-ray emission. We consider these two 
sources to be star candidates, however a more detailed 
analysis (using for example spectroscopy), which is be- 
yond the scope of this paper, would be needed to verify 
this. Such a low fraction of identified stars is consistent 
with star detection ra tes in other deep radio surveys (e.g. 
iFomalont et~aLll2006f) . 

The two star candidates in our radio sample form only 
~ 0.1% of the VLA-COSMOS radio sources, and we ex- 
clude them from our sample for further analysis. 

5.3. Quasi stellar objects 
5.3.1. Identification based on morphology 

In order to identify QSOs in our matched radio sam- 
ple we rely on an optical classification, rather than us- 
ing X-ray emission, due to the much higher sensitivity 
of the observations in the optical (5a sensitivity limi t 
in the Subaru i band is 26.2; see iCapak et al1l200l . 
For example, if one would select AGN relying purely on 
e.g. X-ray - to - optical flux ratios, which are generally 
grea ter than 0.1 for both broad and narrow line AGN 
(e.g. iMaccacaro et alJll988t lAlexander et al.ll2001l ). with 
our optical limit of i — 26 (corresponding to r + of ~ 27) 
the depth of the X-ray point-source detection would have 
to be about 2 orders of magnitude deeper than it cur- 
rently is in order to select a complete sample of AGN. 
Further, a clear distinction between broad and narrow 
line AGN would not be possible. Hence, here we identify 
a QSO by requiring that a given source in the matched 
radio sample is optically compact. In Fig. [7] we show 
the fitted i band FWH M of the sources in the COSMOS 
photometric catalog (jCapak et al.ll200"7t ) as a function of 
their i band magnitude. Point-sources (black squares), 
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Fig. 6. — Color-color diagrams for COSMOS stars and VLA- 
COSMOS QSOs (see text for details). The stars (black dots) form a 
narrow, well defined, locus in each diagram. The star symbols (yel- 
low) show 2 objects detected in the VLA-COSMOS radio survey, 
identified as star candidates (see text for details). VLA-COSMOS 
objects classified as QSOs by our selection criteria (blue squares) 
are also shown, and they occupy the 'standard' QSO regions in 
these diagrams (see text for details). The crosses (red) show ob- 
jects classified as a QSO by our technique, yet their spectra were 
identified as red galaxies. QSO evolutionary tracks are shown to 
guide the eye (green curved lines). 

select ed from the HST/ACS catalog (jLeauthaud et all 
l2007f ). form a locus in this plane, separated from the 
area occupied by extended sources. However, the point- 
source locus is fairly scattered, especially at faint mag- 
nitudes, and thus makes a single automatic cut at a cer- 
tain FWHM value inefficient. For this reason, we clas- 
sify sources within the FWHM range of 1.85" — 2.05" as 
QSOs only if their optical HST morphology was visually 
confirmed to be 'point-source dominated'. However, we 
consider all sources below FWHM of 1.85" to be QSOs. 
We further supplement this sample with 12 objects that 
were classified as point-sources in the HST/ACS catalog, 
but do not satisfy the above criteria. 

In summary, out of 1558 objects we identify 139 (i.e. 
11.5%) as QSOs. In Fig. [5] we show their broad-band 
(u* ,g + ,r + ,i + ) color-color properties. As expected, the 
non-stellar emission of the selected objects confines them 
to regions typical for QSOs, which are sep arated from 
the main stellar loci in thes e diagrams (e.g. iBrusa et al.l 
I2007al iRichards et~al| [2002). A minor fraction of these 
objects lie on the stellar loci. However, in the BzK di- 
agnostic diagram, which is an efficient tracer for stars 
( Dad di et al.ll2004r ) these sources are offset from the stel- 
lar locus, verifying their non-stellar nature. 
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Fig. 7. — FWHM of the optical counterparts of the matched 
radio sources (small dots) as a function of i band magnitude , 
taken from the COSMOS photometric catalog (jCapak et aI.II2007I V 
Grey squares represent the HST/ACS point sources in the sam- 
ple. Unresolved sources form a separated locus in this diagram. 
We identify QSOs by requiring that a source is optically compact 
by choosing an absolute separation value of FWHM < 1.85" (be- 
low lower dash-dotted line; light-grey shaded area). In the range 
of FWHM € (1.85,2.05]" (between upper and lower dash-dotted 
lines; dark-grey shaded area) we identify objects as QSOs only if 
they were visually confirmed to be point-source dominated (see 
text for details). 

As AGN dominated systems usually have soft X-ray 
to o ptical flux ratios in th e range of about 0.1 t o 10 
(e.g. iMaccacaro et al.l 119881 : lAlexander et al.l l200lh . we 
can use the X-ray to optical flux properties of the identi- 
fied QSOs to further test our selection criteria. 43 objects 
in our QSO sample were detected as X-ray point-sources, 
and their X-ray to optical flux ratios are consistent with 
the expected values. The median r + magnitude for these 
sources is 21.3. For the remaining QSO candidates, that 
were not detected in the X-rays, the median r + is 24.4. 
Therefore, these sources are also consistent with the ex- 
pected X-ray to optical flux ratios, however beyond our 
X-ray point-source detection limit (10 -15 erg cm~ 2 s _1 
in the soft band). We conclude that the independent 
analysis of the X-ray properties of our selected QSOs 
verifies the validity of our selection. 

5.3.2. Spectroscopic verification 

A sub-sample of 31 objects of the 139 previously identi- 
fied QSOs have available optical sp e ctroscopy with secure 
classifications dTrump et al.l 120071: iPrescott et al.l 120061 : 
iColless et al1l2001l : iSchneider et al.ll2005f ), and only 3 of 
these objects were classified as red galaxies (| Trump et al.l 
|2007|) . while all the others have AGN classifications. The 
3 galaxies classified as ellipticals were identified as QSO 
candidates by our method based on visual/morphologic 
classification, which suggests the presence of dominating 
nuclear emission. The color properties of two of theses 
objects (see red crosses in the top panel in Fig. [6]) are also 
consistent with the color properties of quasars. 21 Thus, 
we conclude that the selected QSO sample is not signif- 

21 For example, in the u*-g + vs. g+-r+ color-color diagram (top 
panel in Fig.O red galaxies would occupy the upper right quadrant 
(see e.g. lStrateva et af|[200ll') . 
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Fig. 8. — HST/ACS stamps (4" on the side; left panels) and the 
corresponding SDSS spectra (smoothed using 20 A wide bins; right 
panels) for two sources that were spectroscopically classified as 
QSOs, but missed by our QSO selection method. The morphology 
of these sources is clearly extended (in particular the source in the 
top panel is possibly a merging system with double nuclei visible, 
and extending beyond the displayed 4"). 

icantly (< 10%) affected by contamination of non-QSO 
objects. 

In order to assess the completeness of the selected 
QSO sample, we search for objects that are spectroscop- 
ically classified as QSOs and 'missed' by our classifica- 
tion method. Our criteria yielded 139 objects classified as 
QSOs in the matched radio sample, and in the remainder 
of the sample (i.e. the 1417 sources that were not classi- 
fied as star candidates or QSOs) spectroscopic classifica- 
tions are available for 397 objects. Out of these, 9 were 
spectroscopically classified as QSOs. Two SDSS exam- 
ples, for which COSMOS HST/ACS imaging is available, 
are shown in Fig. [5] They obviously show extended op- 
tical emission, and a substantial light component arises 
from the host galaxy itself. The median redshift of these 
9 objects is only 0.4. It is noteworthy that all of these ob- 
jects have X-ray point source detections, and all except 
one have X-ray luminosities higher than 10 42 erg s _1 . 
Therefore these galaxies will be selected into our AGN 
class, hence not contaminating the SF galaxy sample (see 
Sec. 15. 4p . As the spectroscopic sub-sample fairly repre- 
sents the full matched radio sample (see Fig. [3]), we con- 
clude, based on the above analysis, that the sample of 
identified QSOs is about 80% complete. As expected, 
the incompleteness is mostly due to relatively low red- 
shift, low-luminosity AGN. 

5.4. Star forming and AGN galaxies 

In the previous sections we have identified 2 star can- 
didates and 139 QSOs in the matched radio sample. We 
will refer to the 1417 remaining sources in the matched 
radio source sample as the 'galaxy sample'. Before ap- 
plying the rest-frame color based classification method 
to our VLA-COSMOS galaxies in order to separate SF 
from AGN galaxies, we restrict the galaxy sample to 941 
galaxies with redshifts < 1.3, as a) the photometric red- 
shifts are less reliable beyond this redshift, and b) the 
library of BC03 model spectra that we use for the SED 
fitting may not be appropriate for fits beyond this red- 
shift as the distribution of priors was set to optimally 
match this intermediate redshift range. Hereafter, we 
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Fig. 9. — The distribution of the synthesized rest-frame PI color, 
corrected for the systematics (see bottom panel in Fig.0, for the 
objects in the matched radio galaxy sample with z < 1.3. Note, 
that the expected distribution from our local sample, showing a 
prominent tail towards bluer values of PI, is reproduced. The 
dash-dotted line indicates our chosen boundary for separating star 
forming from AGN galaxies. 

will call the sample of 476 galaxies with redshifts greater 
than 1.3 high redshift (high-z) galaxies. 

We perform an SED fit using GOSSIP (as described 
in Sec. I4.2[) for each of the 941 objects in the matched 
radio 'galaxy' sample out to z — 1.3. The distribution 
of the rest-frame color PI for these galaxies is shown 
in Fig. [9] The distribution is very similar to that of 
the local sample (see top panel in Fig. with a peak 
at Pl~ 0.4 (AGN) and a prominent tail towards bluer 
values (SF galaxies). We inspected the behavior of the 
median value of the synthesized PI color for the entire 
z < 1.3 galaxy sample as a function of redshift, and we 
found no significant evolution in the median color. We 
reached the same conclusion analyzing the median PI 
colors of the SF and AGN sub-populations. This implies 
that a fixed cut in the color can safely be applied to the 
entire galaxy population out to z = 1.3. 

The SED fitting was performed via a \ 2 minimization 
procedure. The median value of the reduced x 2 of the 
SED fits, computed using the best fit model spectrum, is 
0.6 with an interquartile range of 2. Only 10% of the fit- 
ted objects have reduced \ 2 values above 5, and only 5% 
above 10. A visual inspection suggests that these galaxies 
are either nearby galaxies, which are resolved and often 
saturated in the Subaru i band, or QSO contaminants. 
While the latter predominantly have blue PI colors, the 
synthesized PI color for the first class of galaxies still 
appears to be a valid tracer for the SF/AGN separation, 
and therefore we do not reject them from the sample. 
In order to select SF and AGN galaxies we require that 
the synthesized PI color is < 0.15 and > 0.15, respec- 
tively. However, to improve our selection at this point 
we make use of the X-ray properties in the soft band 
of the 114 galaxies that were detected as X-ray point 
sources. Namely, if the soft X-ray luminosity of an object 
is greater than 10 42 erg s _1 we consider it to be an AGN, 
regardless of its PI color. Note that this criterion is ex- 
pected to reduce the contamination of the SF sample by 
objects with blue rest-frame colors, such as QSOs missed 
by our selection. Out of the 114 X-ray detected sources, 
77 have X-ray luminosities greater than the above given 
value, and 37 out of these 77 have a synthetic PI color 
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< 0.15. In summary, our selection yields 340 SF and 601 
AGN galaxies in our matched radio galaxy sample with 
z < 1.3. We analyze these galaxies further in Sec. [7] 

6. COMPARISON WITH OTHER SELECTION METHODS 

In this Section we compare our classification method, 
that we have applied to an intermediate redshift popu- 
lation, with other classification methods used for both 
local an d intermediate redshift populations in the lit- 
eratur e (jLacv et al l l2004t IStern et all l2005t iBest et all 
l2005al) . We also study the 24/jm properties of our radio 
sources, and their correlation to the 1.4 GHz emission. 

6.1. 3.6-8 [xra color - color diagnostics 

QSOs, whose UV to NIR continuum is dominated by 
a power law, tend to be redder than other types of 
galaxies in the MIR. Hence, they occupy a distinct re- 
gion in MIR color space, and several color-color crite- 
ria were su ggeste d for their selection (|Lacv et al.l 12004 
IStern et all 12005 h In Fig. Q1 we compare our classifi- 
cation method with those proposed in the MIR using a 
sub-sample of the matched radio sources that were also 
detected with IRAC (~ 90% have IRAC counterparts; 
see Sec. IP]) . We indicate the QSO (dots), AGN (thin 
contours) and star forming (thick contours) galaxies se- 
lected using our method. The dashed lines in the top 
and bottom panel in Fig. ITOl show th e color-color criteri a 



proposed by lLacv et all (|2004h and IStern et~aT1 (|2005f) . 



respectively, for the selection of broad-line AGN. As ex- 
pected, the majority of objects selected as QSOs by our 
method falls within this region, reassuring the efficiency 
of the classification method presented here. There are 
several QSO candidates outside these regions, which is 
not surprising as the suggested 'quasar regions' do not 
select a 100% complete sample of QSOs, and a cert ain 
amount of outliers is expected ( see IStern et al.H2005l for 
a discussion of this point). In Sec. 15.31 we have inferred 
that our selected sample of QSOs is not significantly con- 
taminated by different types of objects, which is affirmed 
b y this independent analysis. 

IStern et alJ (|2005l ) showed that at redshifts of < 1 
galaxies span a large range in the 7715.8 — fns.o color, 
which is consistent with the horizontal extent of our se- 
lected star forming and AGN galaxies (see bottom panel 
in Fig. fT0|) . However, typical low luminosity AGN and 
starburst galaxies cannot be clearly divided using these 
diagnostic diagrams (see also color evolutionary tracks 
in Fig. [T9|). Nonetheless, elliptical galaxies (which corre- 
spond to our class of absorption line AGN) tend to oc- 
cupy the bottom left regions in both diagrams, and close 
to these regions the distributions of our identified AGNs 
peak. On the other hand, the peak of the distribution 
of our selected star forming galaxies in these diagrams is 
clearly displaced from the one for AGN. We would like to 
stress that this independently confirms that indeed two 
different populations of galaxies have been selected. The 
differences in the MIR properties of our SF and AGN 
galaxies affirm the strength of the rest-frame color based 
classificati on method. 

Further, St ern et "all (|2005f ) showed that narrow line 
AGN appear spread out in both the QSO and galaxy 
regions, which is also a result of our selection method 
[note that the selected AGN are present in both regions] . 
The last point we want to stress is that the star forming 
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Fig. 10. — IRAC color-color diagrams in two representations (top 
and bottom panel) used as diagnostic tools for the separation of 
QSOs (broad line AGN) from 'normal' galaxies (Lacy ct al. 2004; 
IStern et all l2005h . Filled circles display the 122 QSOs selected 
by our morphologic method that have IRAC detections; thin and 
thick contours show the 579 AGN and 322 star forming galaxies 
separated by our rest-frame color based classification method, re- 
spectively, that have IRAC counterparts. The contour levels in 
both panels are in steps of 7 starting at 7. The dashed lines in 
the top and bottom panels show the empir ically selected reg ions 
for identify ing broad-line AG N proposed by Lacy et al. ( 20041. top 
panel) and Stern ct al. (2005, bottom panel), respectively. 

galaxy locus in these diagrams is also consistent with 
the expected colors, as a 'contamination' by star forming 
galaxies of the QSO locus is expected, especially close to 
the boundary. In summary, the classification method 
presented here agrees remarkably well with the expected 
properties of QSOs, AGN and star forming galaxies at 
intermediate redshifts in the MIR range encompassing 
3.6-8 /im. 

6.2. 24 [irn 'properties: The 24 fJ- m ~ radio correlation 

A tight mid-IR (as well as far- and total- IR) - radio 
correlation is expected for star forming galaxies, while 
'radio-l oud' AGN are expected to strongly deviate from 
it (e.g. ICondodfl99i lBeill2003t lAppleton et 1012001 . 
The 60 fim - r adio correlat i on fo r low-luminosity AGN 
was studied by I O brie et al.l (|2006f ) in the local universe. 
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Based on a selection utilizing the BPT diagram they have 
shown that also low -luminosity AGN follow a tight FIR 

- radio correlation, however with a slightly different slope 
and a larger scatter than SF galaxies. In this section we 
investigate the 24 /xm - radio correlation for our selected 
SF and AGN galaxies. In particular, if our SF/AGN 
separation method is successful, then a difference in the 
24 /jm compared to 20 cm properties is expected to be 
seen for the two populations. 

Our rest-frame color based classification method has 
identified 340 SF galaxies. Out of these 82% (280) were 
detected at 24 /xm with a signal to noise > 3. On the 
other hand, out of 601 selected AGN only 44% (267) 
have a MIPS 24 /xm detection with S/N > 3. In Fig. [TT] 
we show the 24 /im vs. 1.4 GHz luminosity (top panel) 
for our SF and AGN galaxies, where the 24 jum data was 
not k-corrected. A correlation between the two luminosi- 
ties exists for both types of objects detected at 24 /xm, 
although on average for a given Li.4ghz the 24 /jm lu- 
minosity is slightly lower for AGN than for SF galaxies 
(see also below) . For the SF and AGN galaxies that were 
not detected at 24 /im we have computed upper limits of 
the 24 /xm luminosity using the detection limit of the S- 
COSMOS MIPS shallow survey which is 0.3 mJy. These 
limiting luminosities are also shown in Fig. [TT] Note 
that for AGN galaxies, as 56% of them are not detected 
at 24 /xm, the scatter in the correlation is significantly 
increased by these objects. 

To quantify t he correlation, we derive the classical q 
parameter (e.g. ICondonl I1992T ) as the logarithm of the 
24 /xm to 1.4 GHz observed flux ratios. This parameter 
essentially measures the slope of the correlation, and in 
the bottom panel in Fig. [TTJ we show it as a function of 
Li.4GHz for our SF and AGN galaxies, with indicated up- 
per limits (derived as described above). The q parameter 
seems to show a decreasing trend with increasing radio 
luminosity. However, this trend is dominated by the ob- 
jects that have only estimated upper limits, and therefore 
may be mimicked by the flux limits of the samples. A 
more detailed analysis of this issue is beyond the scope 
of this paper. 

The distribution of the q parameter for SF and AGN 
galaxies is shown in Fig. [T2J for galaxies detected at 
24 /xm and those which only have upper limits. The 
median q value for SF galaxies is 0.82 ± 0.05 with a scat- 
ter of ~ 0.3 when all objects (also the upper limits) are 
taken into account. On the other hand, the median q 
value for the AGN galaxies is 0.51 ± 0.02, significantly 
lower than for SF galaxies. We also find a larger spread 
in q (~ 0.4) for the AGN population. Note, however, 
that the spread quoted here should be considered some- 
how tentative, especially for AGN, as the exact q values 
for the fraction of objects not detected at 24 /xm are not 
known. Nonetheless, this does not affect the estimates 
of the median values. Our q parameter derived for SF 
gal axies is remarkably co nsistent with the one inferred 
bv lAppleton et all (|2004h at 24 /xm. Combining Spitzer 

- MIPS and VLA observations in the First Look Sur- 
vey with optical spectroscopy, they have found a q value 
of 0.84 with a spread of 0.28 (with no k-corrections ap- 
plied). They have also shown that the FIR luminosities 
of AGN tend to be lower for a given radio luminosity, 
consistent with our findings here. 

Finally, in Fig. [13] we show the q parameter as a func- 
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Fig. 11. — Top panel: 24 (im luminosity vs. 1.4 GHz luminosity 
for our selected SF (grey squares) and AGN (black dots) galaxies. 
Upper 24 fim luminosity limits for the radio SF (18%) and AGN 
(56%) galaxies that were not detected at 24/im are indicated by 
grey and black arrows, respectively. Bottom panel: The q param- 
eter, denned as the logarithm of the 24 fim to 20 cm flux ratio, 
as a function of 20 cm luminosity for SF (grey squares) and AGN 
(black dots) galaxies. Upper limits are indicated by arrows, as in 
the top panel. Note the much larger scatter in q for AGN than for 
SF galaxies. 



z 30 



z 40 



" SF 

<q> = 0.82 










agn r 

<q> = 0.51 1 
r - J 









-10 12 

q 



Fig. 12. — The distribution of the q parameter (see also Fig. lilt 
for SF (top panel) and AGN (bottom panel) galaxies. In both 
panels the solid histograms show the distribution of the 1.4 GHz 
sources detected at 24 fim while the dash-dotted histograms repre- 
sent the distribution of the upper limits of q obtained for the radio 
sources that were not detected at 24 fim (see text for details). The 
solid vertical line in each panel designates the median value of the 
entire distribution, also listed in the top left in each panel. 
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Fig. 13. — q as a function of redshift for SF (grey squares) 
and AGN (black dots) galaxies. Note that q does not depend on 
redshift, implying that the 24 /im- radio correlation holds out to 
high redshift (see text for details). 

tion of redshift for our SF and AGN galaxies, q does 
not depend on redshift, both for SF and AGN galax- 
ies, implying that the MIR - radio correlation with the 
same slope is valid out to high redshifts (z ~ 1.3). 
This result is again c onsistent with those presented in 
lAppleton et~ai1 (|2004f) . 

In summary, the above results have shown that the 
24 /um - radio correlation has different properties for 
our selected SF and AGN galaxies, which verifies the 
efficiency of our rest-frame color based classification 
method. 

6.3. Selection based on optical spectroscopic properties, 
radio luminosity and stellar mass 

iBest et all (|2005aD defined a sample of ~ 3000 local 
(0.01 < z < 0.3) galaxies from the SDSS DR2 "main" 
spectroscopic sample matched with sources above 5 mJy 
detected in the NVSS survey. They further divided the 
radio - optical source sample into AGN and SF galaxies 
making use of the galaxies' location in the plane spanned 
by the 4000 A break [D n (4000)] and radio luminosity 
[Li.4GHz] normalized by stellar mass [M*]. In Fig. [T4lwc 
compare our rest-frame c olor based classifica tion method 
with the one utilized bv lBest et al.l (|2005al ) in the local 
universe. The D n (4000) vs. log (Li.4GHz/M*) distribu- 
tion for all galaxies in the matched radio source sample 
is shown in Fig. Q31 The 1.4 GHz luminosity for these 
galaxies was derived, and D n (4000) and M» from the 
best fit template from the SED fitting (see Sec. I4.2.T]) . 
The average errors are indicated. The dashed line cor- 
responds to the sep a ration between SF and AGN pro- 
posed bv lBest et al.l (|2005aD . and the two types of sym- 
bols designate the SF (squares) and AGN (dots) galax- 
ies identified by our rest-frame color based classifica- 
tion method. We want to note that Best et al. cali- 
brated their separation method using a slightly differ- 
ent selection of objects in the BPT diagram (see their 
Fig. 9) with respect to the one we use here. Therefore, 
a perfect correspondence between our and the Best et 
al. method is not to be expected, even if our derived 
quantities were absolutely accurate. The area in the 
D n (4000) vs. log (Li.4GHz/M*) plane where the major 
disagreement is expected, due to the different selections 
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Fig. 14.— The 4000 A break, D n (4000), vs. 1.4 GHz radio 
luminosity normalized by stellar mass, Li .4GHz/M„ for galaxies 
in the matched radio sample. M* and D n (4000) were synthesized 
via SED fitting by GOSSIP. The black and grey symbols show ob- 
jects classified by our rest-frame color based classification method 
as AGN and star forming galaxies, respect ively. The dashed line 
corresponds to the separation proposed by Best et al. (2005a) for 
local galaxies. The average errors of the synthesized quantities are 
shown in the top right. A good consistency between the two selec- 
tion methods exists, given the large error bars, as well as a slightly 
different selection rational used here and in Best et al. (2005a; see 
text for details). 

in the BPT diagram, is in the range of 1.4 <D n (4000) < 
1.6, and 11 < log (Li.4ghz/M*) < 12. This is the region 
where a larger fraction of Seyfert and LI NER galaxies is 
located (see Fig. 9 in IBest et ail 12005a!) , and, different 
from Best et al., we define these galaxies exclusively as 
AGN. In this region in Fig. Q3]we indeed see the largest 
disagreement between the two classifications. Further, 
the existence of objects with D n (4000) < 1.3 that we clas- 
sify as AGN is not surprising, but it rather reflects the 
dual properties of composite objects, which in this case 
were classified as AGN by our rest-frame color based clas- 
sification method. We also want to note that the average 
error in the synthesized D n (4000) [derived from compari- 
son with the spectroscopic and synthetic D n (4000) in the 
local sample] is fairly large, and thus prevents a more 
detailed comparison between the two selection methods. 
Overall, given the error bars and the difference in the 
basic selections of the two methods, as well as the fact 
that our D n (4000) values are not spectral measurements 
on the data, but values taken from the best fit template, 
we conclude that our rest-frame color based cl assification 
method agrees well with the one proposed by IBest et all 
(20053) in the local universe. 

In summary, our rest-frame color based classification 
method for separating SF from AGN galaxies agrees well 
with other selection schemes, proposed in the literature, 
which are based both on MIR colors and optical spectro- 
scopic diagnostics. 



7. 



DISCUSSION: THE COMPOSITION OF THE FAINT 
RADIO POPULATION 



In previous Sections we have presented, tested, and 
discussed in detail the photometric classification method 
which we used to separate the matched radio source 
sample into stars, QSOs, star forming, AGN and high- 
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z galaxies. In this Section we discuss the properties of 
the 'population mix' in the VLA-COSMOS survey: In 
Sec. l7.ll we describe the redshift and luminosity distribu- 
tions of the selected SF and AGN galaxies, and in Sec. 17. 21 
we study the contribution of different source types to 
the sub-mjy radio population. We show, based on the 
matched radio sample, as well as on the remaining ra- 
dio sources with no optical counterparts (brighter than 
i = 26), that star forming galaxies do not dominate the 
sub-mjy sources, but that the majority of these sources 
is rather comprised of AGN and QSOs. 

7.1. The redshifts and luminosity distributions of SF 
and AGN galaxies out to z = 1.3 

The rest-frame color based classification method 
yielded 340 star forming and 601 AGN galaxies out to 
redshifts of 1.3. In the top panel in Fig. [15] we show 
the redshift distribution for these galaxies using red- 
shift bins of 0.217 in width. We use such wide red- 
shift bins to assess the average properties of the radio 
population, reducing the effects of fluctuations due to 
the strong and narrow overdensit ies which are known 
to exist in the COSMOS field (fScoyille et all l2007al 
iFinoguenov et all l2007t ISmolcic et all l2007bf ). Poisson 
errors are indicated for each bin. The deficit of galax- 
ies at the low-redshift end reflects the relatively small 
comoving volume sampled by the 2D° area of the COS- 
MOS field at these redshifts. The decline in the number 
of sources at the high-redshift end, on the other hand, 
reflects the detection limit of the VLA-COSMOS sur- 
vey. The redshift distribution of the number of star 
forming galaxies seems to be more uniform than the 
one for AGN, in particular the relative number of star 
forming galaxies compared to AGN rises at higher red- 
shifts (z ~ 1). This may be explained by the relatively 
high n umber density of ULIRGs expected at th ese red- 
shifts ()Le Floc'h et all 120051 : ICaputi et al1l2007h in con- 
junction with the VLA-COSMOS detection limit which 
at these redshifts allows to sample only radio luminosi- 
ties larger than 3 x 10 23 WHz -1 (see Fig. [16] below). 
Further, as the comoving volume surveyed at z ~ 1 is 
larger than the one surveyed locally, the probability of 
detecting a ULIRG is also increasing at these redshifts. 
Effects of cosmic variance as a function of redshift can- 
not be excluded, however they should be smaller than 
for other deep radio surveys that typically probe signifi- 
cantly smaller areas. An increase of the AGN fraction at 
z ~ 0.7 is discernible. It possibly arises due to the dense 
large scale st ructure component in the COSMOS field at 
this redshift (|Scoville et al.ll2007bl : iGuzzo et al.ll2007f) . 

In the bottom panel in Fig. Qjj] we show the fractional 
contribution of the SF and AGN galaxies to the z < 1.3 
matched radio population, as a function of redshift. On 
average, we find that the mean fractional contribution of 
SF and AGN galaxies to the z < 1.3 matched radio pop- 
ulation is (34 ±14)% and (66 ± 14)%, respectively. This 
is strikingly similar to the relative numbers of SF and 
AGN galaxies in the local Universe. Namely, if we ap- 
ply the adopted PI color cut to the SDSS/NVSS galaxy 
sample (see Sec. 14. ip . we find that ~ 32% of the galaxies 
are star forming, and ~ 68% are AGN. If, as shown by 
the tests described in Sec. l4.1[ our rest-frame color based 
classification method can reliably be applied also to high 
redshift galaxies, then the similarity of the SF and AGN 
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Fig. 15. — Top: Redshift distribution of i) the galaxies in 
the matched radio sample out to z = 1.3 (thick line), ii) the 340 
selected star forming galaxies (light-grey shaded histogram) and 
iii) 601 identified AGN (dark-grey shaded histogram). Bottom: 
The fractional distribution of the SF and AGN galaxies compared 
to the total number of galaxies in the matched radio sample out to 
z = 1.3 as a function of redshift. In both panels Poisson errors are 
indicated. 



fractions suggests that the two populations have similar 
evolutionary properties out to z ~ 1.3. 

In Fig. [TBI we show the 1.4 GHz luminosity as a func- 
tion of redshift for the selected SF and AGN galaxies 
out to z = 1.3. We also indicate the expected luminosity 
ranges for Milky Way type galaxies, LIRGs, ULIRGs and 
HyLIRGs, which were derived using the total IR - radio 
correlation (Bell 2003). It is noteworthy that the ma- 
jority of galaxies with luminosities typical for HyLIRGs 
(Lir > 10 13 L©) was classified by the rest-frame color 
based classification method as AGN, cons istent with the 
expected properties of these galaxies (e.g. IVeilleux et all 
1999; Tran et al. 2001). This point is seen more clearly 
in Fig. [T7l where we show the distribution of the 1.4 GHz 
luminosity for the selected star forming and AGN galax- 
ies. The median luminosities are ~ 1.6 x 10 23 WHz -1 
and - 3.2 x 10 23 WHz" 1 for SF and AGN galaxies, 
respectively. Although the median luminosities of the 
two populations are different only by a factor of 2 (note 
that this is drawn from a luminosity distribution of a 
flux limited sample), there are some significant differ- 
ences at both high and low radio luminosity. At high 
luminosities there is the strong decline of the number 
of SF galaxies with luminosities above ~ 10 24 WHz -1 , 
while AGN show an extended tail towards the bright- 
est 1.4 GHz luminosities. Such a behavior is consis- 
tent with results from local studies, which suggested that 
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Fig. 16. — 1.4 GHz luminosity as a function of redshift for the 340 
selected star forming galaxies (grey squares) and 601 AGN (black 
dots). The horizontal (dot-dashed) lines correspond to 1.4 GHz 
luminosities typical for various classes of galaxies, obtained using 
the total IR — radio correlation (Bell 2003). The solid curved line 
corresponds to the VLA-COSMOS 5<x limit of ~ 50 fijy. Note also 
that the VLA-COSMOS survey is sampling the entire LIRG and 
ULIRG populations out to redshifts of ~ 0.4 and ~ 1, respectively. 

'nor mal' 22 galaxies tend to have Li.4gh z < 10 24 WHz -1 
(e.g. ICondonl fl992). It is noteworthy, that our SF and 
AGN galaxies were identified completely independently 
from their radio luminosity, yet their 1.4 GHz luminosi- 
ties match the expectations based on local studies. At 
low luminosity (< 2 x 10 22 WHz -1 ; below the typical 
LIRG radio luminosity) the fraction of SF galaxies in- 
creases and the numbers of SF and AGN galaxies are 
similar to each other. 

Further, the luminosity distribution shown in Fig. [T7] 
agrees well with local results, which have shown that 
Li.4GHz for star forming and (absorption and emission 
line) AGN shows overlapping distr ibutions, and con- 
sequently no clear separation (e.g. 



Sadler et alj 119991 : 

I Jackson fc Londish 2000; C han et al.ll2004l ). In the local 
universe I Sadler et alJ ([1999) inferred a median Li 4GHz 
for SF and AGN galaxies to be - 3 x 10 22 WHz -1 and 
~ 3 x 10 23 WHz -1 , respectively. Hence, our median 
value for the luminosity of VLA-COSMOS AGN (3.2 x 
10 23 WHz -1 ) out to z = 1.3 matches the one inferred 
locally, however for SF galaxi es (1.6 x 10 23 WHz -1 ) it 
is higher than that derived bv ISadler et al.l (|1999f ). The 
latter is easily understood as the combined effect of the 
higher median redshift (~ 0.7) of the galaxies in our flux 
limited sample (thus not probing low Li.4GHz) and of the 
higher level of star formatio n activity, which is observed 
going from redshift to 1 ( Ma dau et al.l [19961: iHonkind 
2004) also implying higher Li.4gh~z (|CondoiJll992t iBell 



20Q3J ) . 

We caution at this point that luminosity distributions 
of flux limited samples are strongly dependent on the 
detection limits, and their interpretation has to be ap- 
proached carefully. For this reason, in Fig. [17] we also 
show the luminosity distributions for volume limited 
samples of our selected SF and AGN galaxies in three 

22 'Normal' galaxies, in terms of radio properties, are broadly 
denned as galaxies whose radio emission is not powered by a super- 
massive black hole. These galaxies are a mix of spiral, dwarf ir- 
regular galaxies, peculiar and interacting s ystems, as wel l as E/S0 
galaxies with ongoing star formation (see Condon 1992 for a re- 
view) . 
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Fig. 17. — Distribution of the logarithm of the 1.4 GHz lumi- 
nosity for the selected star forming (light-grey shaded histograms) 
and AGN (dark-grey shaded histograms) galaxies for the entire 
flux limited sample (top panel), and for volume limited samples in 
3 redshift ranges (bottom 3 panels). In the top panel the median 
log(Li.4GHz) values for the full star forming and AGN samples, 
and the average error in log (Li ,4GHz) are indicated. The redshift 
and luminosity ranges of the volume limited samples are given in 
the bottom 3 panels. 



redshift ranges. On average AGN galaxies have higher 
1.4 GHz luminosities than SF galaxies, reassuring the va- 
lidity of our selection method. Note, however, that in the 
lowest redshift range the AGN and SF galaxy distribu- 
tions are very similar. This is not surprising, but rather 
consistent with the observed luminosity range, that en- 
compasses the region of indistinguishable space density 
of SF and AGN galaxies in the local universe (see local 
1.4 GHz lumin osity functions for SF and AGN galaxies; 
e.g. lBesgi200l . 

7.2. The 'population mix' in the VLA-COSMOS survey 

In this Section we study the contribution of different 
sub-populations to the the total sub-mJy radio popula- 
tion. The key question we want to answer is: Is the 
sub-mJy population dominated by any particular sub- 
population, which may be the main cause for the ob- 
served flattening of the differential radio source counts 
below 1 mJy (fo r VLA-COSMOS source counts; see 
IBondi et aIJl2007h ? 

7.2.1. Is the matched radio sample at sub-mJy levels 
dominated by star forming galaxies? 

In order to obtain an insight into the 'population 
mix' of the faint radio sources in the matched radio 
source sample, in Fig. [15] we show the distribution of 
the 1.4 GHz total flux density, F to t, for the SF and AGN 
galaxies in the matched radio sample out to z = 1.3, 
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as well as for the identified QSOs. 23 Note, that the re- 
maining z > 1.3 galaxies in the matched radio sample 
are defined as high redshift (high-z) galaxies. The flux 
bins in Fig. [18] are 0.15 mJy wide. Such wide bins al- 
low us to study the average behavior of the galaxies in 
the sub-mJy population with decreasing flux densities. 
Our main aim is to answer one of the more controversial 
questions in radio astronomy: Is the sub-mJy radio pop- 
ulation dominated by star forming galaxies or any other 
distinct sub-population? 

From the top panel in Fig. [18] it is obvious that at 
flux densities above ~ 0.7 mJy we are hampered by low 
number statistics (the total number of sources in each 
bin is below 20). Therefore, for the purpose of this 
paper we will focus only on sources with flux densities 
below 0.7 mJy down to the VLA-COSMOS 5ct limit of 
- 50 ^Jy. 

The middle panel in Fig. [TBI shows the fractional distri- 
bution of the identified populations in the matched radio 
sample, with indicated Poisson errors. Our findings are 
as follows. QSOs contribute to the matched radio sample 
at a constant level of about 10%. AGN galaxies below 
z = 1.3 show a decreasing trend with decreasing flux 
densities, with their contribution to the matched radio 
sample dropping from ~ 60% to less than 40%. The SF 
galaxies at z < 1.3 in the flux density range of 50 /xJy 
to 0.7 mJy contribute fairly constantly at the given flux 
densities, with an average contribution of about 20%. 
Note that the possible increment from 0.7 mJy to 50 /xJy 
of only ~ 10% is not significant. In Sec. 14.2.21 we have 
inferred that the photometric errors in the synthesized 
PI color introduce a positive bias of ~ 5% for SF galax- 
ies. As this bin contains the lowest number of SF galax- 
ies, it is the most affected by this bias. In Sec. 14.1.21 
based on the local SDSS/NVSS sample of galaxies, we 
have shown that our rest-frame color based classification 
method selects ~ 70% of 'real' SF galaxies, which make- 
up ~ 85% of the complete sample of SF galaxies, while 
the photometrically selected AGN sample is contami- 
nated by SF galaxies at the 5% level. Assuming that the 
percentages of completeness and contamination, derived 
from the analysis of the SDSS/NVSS sample of galaxies, 
can be safely applied also to our VLA-COSMOS sam- 
ple, we can use them to correct the observed fractions. 
Even with the correction the fractional contribution of 
SF galaxies at z < 1.3 in the matched radio popula- 
tion essentially stays the same, i.e. about ~ 20% at flux 
densities in the range fr om 50 /xJy to 0.7 mJy- Contrary 
to pr evious studies (e.g. lSevmour et a"i1l2004tlBenn et al.1 
[1991 . our results show that SF galaxies at intermediate 
rcdshifts are not the dominant population at sub-mJy 
flux density levels. However, it may be possible that a 
significant number of SF galaxies at z > 1.3 exists, which 
may contribute strongly to the sub-mJy population. We 
investigate this possibility in the bottom panel in Fig.fLSl 
and in Fig. [T9] and show below that this is not the case. 

About 30% of the matched radio sample consists of 
galaxies at redshifts beyond z — 1.3 (high-z; see bot- 
tom panel in Fig. [T8"]). The contribution of these galax- 
ies marginally increases from 20% at 0.7 mJy to 35% at 
50 /iJy. Although we were not able to apply our rest- 

23 Photometric redshift information for QSOs in the COSMOS 
survey is not available at this point. 
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Fig. 18.— Top: The distribution of the 1.4 GHz total flux for 
the star forming (vertically hatched histogram) and AGN galaxies 
(light-grey shaded histog ram ) in the matched radio sample out 
to z = 1.3 (see also Fig. 11711 . Shown is also the distribution for 
the selected QSOs (dark-grey shaded histogram), for which good 
redshift information is not available at this point. 'MRS' stands 
for the 'matched radio sample'. Middle: The relative contribution 
of SF, AGN galaxies out to z = 1.3, and QSOs to the matched 
radio sample. Note that the missing fraction of sources consists of 
the high redshift (high-z) objects, which we define as galaxies in 
our matched radio sample with z > 1.3. Bottom: The cumulative 
distribution of SF, AGN galaxies out to z = 1.3, QSOs and high-z 
galaxies in the matched radio sample. The indicated error bars in 
the middle and bottom panels are derived from Poisson statistics. 
Note also the different flux scales in the top and the middle/bottom 
panels. 



frame color based classification method to these galax- 
ies in order to classify them, we can nevertheless draw 
some conclusion about their nature by studying their 
multi-wavelength properties. Therefore, in Fig. [19] we 
plot the MIR diagnostic diagram, for the high-z galax- 
ies and compare their properties to the classified sources 
in the matched radio sample. We overlay non-evolving 
color tracks for the typical starburst galaxy M82, an el- 
liptical 13 Gyr old galaxy, and a Seyfert-2 type SED, ob- 
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Fig. 19. — MIR color - color digram (analogous to Fig. 1101 
, bottom panel) for the full matched radio sample. Gray dots 
represent sources classified using our classification method (i.e. SF 
and AGN galaxies out to z = 1.3, and QSOs), while the black 
dots show the high-z sources (galaxies in the matched radio sample 
beyond z = 1.3). Open black squares represent the high-z sources 
that have an XMM point source counterpart. The dashed lines 
are indicated to guide the e ye, and correspond to the AGN type-1 
selection region, proposed by fStern et al.j j2005D . The curved green, 
blue, and red lines correspond to the color-color tracks, obtained 
from SEDs of the starburst galaxy M 82, a composite of 28 Seyfert 2 
galaxies, and a 13 Gyr old elliptical galaxy, respectively, in the 
redshift range from 0.1 to 2.5. 

tained from a co mposite spectrum of 28 randomly chosen 
Seyfert galaxies (|Polletta et al.| [2006). As expected, the 
properties of high-z galaxies are consistent with proper- 
ties of higher redshift (z > 1.5) galaxies. However, the 
region they occupy in this MIR color-color diagram is 
occupied in a similar way by different sub-populations, 
such as star forming, Seyfert- type, and passively evolv- 
ing galaxies, in the redshift range from about 1.5 to 3 
(see color tracks in Fig. [T9l . Further, it is possible that a 
fraction of these sources are broad line AGN as they are 
located within th e area outlined by dashed lines, which 
was proposed bv lStern et al.1 (120051) for th e selection of 
AGN Type-1 fsee also lCaputi et al l [20071 ). This is fur- 
ther strengthened by the 22 high-z sources which were 
also identified as X-ray point sources, and lie within the 
Type-1 AGN selection region. Further, given the high 
redshift of these sources, combined with the XMM detec- 
tion limit, it is highly unlikely that any of these sources 
may be star forming (see e.g. Fig. 14 in iTrump et al.l 
2007). This analysis alone already indicates that the 
matched radio sources beyond z — 1.3 are most prob- 
ably a mixture of different sub-populations. 

The population mix in the high-z sample is further af- 
firmed by the distribution of the high-z galaxies in the 
BzK diagram (shown in Fig. I20j) . which is commonly 
utilized for the selection of z > 1.4 galaxies, and sep- 
arates well passively-evolving galaxies from those origi- 
nally defined as 'star forming' at z > 1.4 (Dad dTet al.l 
12004) . Form Fig. fZU\ it becomes obvious that the BzK 
criterion does not select a pure 'star forming' sample at 
z > 1.4, as initially postulated, but a sample that is com- 
prised of both SF galaxies and low-luminosity AGN. In 
addition, the X-ray detected high-z sources, which may 
be classified as AGN with high confidence as discussed 
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Fig. 20. — B — z vs. z — K color-color diagram for the entire 
matched radio sample. The symbols are analogous to Fig. 1191 Gray 
dots represent sources identified using our classification method 
(i.e. SF and AGN galaxies out to z = 1.3, and QSOs), the black 
dots show the high-z sources, and open squares show XMM de- 
tected point sources in the high-z sample. The dashed lines sep- 
arate regions of passively evolving (top right; outlined by the di- 
agonal and horizontal dashed lines) and star-forming gala xies (left 
of the diagonal dashed line) at z > 1.4, adopted from Daddi et al. 
[|200lh . The curved green and blue lines correspond to the color- 
color tracks, obtained from SEDs of the starburst galaxy M 82, and 
a Seyfert 2 composite, respectively, in the redshift range from 0.1 to 
2.5. Note, that Seyfert type of galaxies at z > 1.4 are also present 
in the region of 'st ar-forming galaxies', as initially postulated by 
IDaddi et all | |2U04T1 . 

above, also lie within this region. Therefore, the SED 
color tracks combined with the distribution of the high-z 
galaxies (black dots in Fig. H20"|) , and the X-ray detected 
subsample suggest that our z > 1.3 matched sources are 
a fair mixture of SF and AGN galaxies, yielding that 
the high-z galaxies continue to consist of different galaxy 
populations at higher redshifts. 

For reference, the i band magnitude distribution for 
the identified sub-samples in the matched radio sample 
is shown in Fig. [5T] Note that the high-z galaxies are 
the faintest optical sources, consistent with the expecta- 
tions for high redshift sources drawn from a flux limited 
sample. However, we want to mention that even in the 
most extreme case that all of the high-z galaxies were star 
forming, still the fractional contribution of star forming 
galaxies to the matched radio source population would 
not dominate over the contribution of AGN and QSOs, 
but the two contributions would rather be comparable. 

As shown in Fig. [20] it is likely that both classes of ob- 
jects contribute similarly to the high-z galaxies, although 
their exact relative fractions cannot be determined from 
this analysis. However, if we assume that roughly 50% 
of the high-z galaxies are SF galaxies, then the fraction 
of SF galaxies in the matched radio sample would in- 
crease to less than 40%. Therefore, we conclude that the 
population of star forming galaxies is not the dominant 
population at sub-mJy levels, at least in our sample of 
radio sources with optical counterparts out to i = 26. 

Our results are s trikin gly similar to the recent results 
bv lPadovani et all (|2007h . who studied the sub-mJy radio 
source population in the Chandra Deep Field South down 
to a 5(7 limit of 42 /zJy. They found that SF galaxies 
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Fig. 21. — Distribution of the i band magnitude (Subaru where 
available, otherwise CFHT) for sources in the VLA-COSMOS 
matched radio sample (MRS). Also shown are the distributions 
for the identified sub-samples of sources: QSOs, star-forming (SF), 
AGN, and high-z galaxies 

make up only about 20% to 45% (i.e. roughly 1/3) of the 
sub-mjy radio sources. 

7.2.2. The contribution to the sub-mjy radio population 
from radio sources with no, or with flagged, optical 
counterparts 

Most of the previous studies (as well as our study up 
to this point) of the faint radio population have relied 
on sub-samples of the observed radio sources, that have 
been identified with optical counterparts out to i = 26, 
as representative o f the entire sub-m jy radio popula- 
tion. For example, iBenn et all (|1993l ) used a sample of 
only 87 out of 523 (i.e. less than 20%) 1.4 GHz radio 
sources above 0.1 mJy, for which they obtained optical 
spectroscopy (B < 22), to conclude that above 1 mJy 
about 50% of the galaxies were SF or Seyfert galax- 
ies, whil e below 1 mJy the frac tion increases to ~ 90%. 
Further, iGruppioni et alj (|1999fl studied optical spectro- 
scopic properties of 34 radio sources above 0.2 mJy in the 
Marano Field down to B = 24. This sample comprised 
~ 60% of the entire sample of faint radio sources, and 
they concluded that the SF galaxies do not constitute 
the main galaxy population of their radio sources, and 
even at sub-mjy levels the majority of their radio sources 
were identified with early type galaxies, consistent with 
AGN. Gruppioni et al. attributed t he difference in th eir 
results compared to the results from lBenn et al.l (|1993l ) to 
the fainter optical magnitude limit reached for their ra- 
dio sample. In this work we have an order of magnitude 
larger sample size (1558 sources with optical counter- 
parts), and a significantly deeper optical limit (i = 26) 
than previous studies. However, still our matched ra- 
dio source sample consists of only ~ 65% of the VLA- 
COSMOS 1.4 GHz sources. Therefore, it is important 
to investigate the contribution of the remaining ~ 35% 
of the radio sources, with no identified or flagged optical 
counterparts, to the sub-mjy population. It may indeed 
be possible that a 'missing' population of objects, that 
significantly contributes to the sub-mjy population, is 
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Fig. 22.— Top: The distribution of the 1.4 GHz total flux for 
sources in the matched radio sample (single hatched histogram), 
and for other VLA-COSMOS sources that have either no identified, 
or a photometrically flagged, optical counterpart brighter than i = 
26 (cross-hatched histogram). Bottom: The fractional distribution 
of the two samples compared to the entire sample of 1.4 GHz radio 
sources. Indicated error bars are derived from Poisson statistics. 

'hidden' in this sample. We show below that this is not 
the case. 

If the above hypothesis is true, then the properties 
(such as e.g. MIR colors) of these remaining objects are 
expected to be distinct from the properties of objects in 
the matched radio sample. In order to shed light on this, 
in Fig. [22] (top panel) we show the distribution of the 
total flux density for the 1558 sources in the matched ra- 
dio sample, and for the remaining 830 sources that were 
a) not identified with optical counterparts with i < 26 
or b) have optical counterparts with i < 26 but in pho- 
tometrically masked-out region (see Sec. 13. 1|) . The bot- 
tom panel shows the fractional contribution of these two 
samples compared to the entire VLA-COSMOS 1.4 GHz 
population. The fraction of matched radio sources is sta- 
tistically consistent to be constant at ~ 65% at all faint 
flux density levels, although formally it decreased from 
~ 75% at - 0.7 mJy to ~ 60% at the limit of the VLA- 
COSMOS survey. On the basis of these high percent- 
ages, it is unlikely that any further population, that is 
not present in our matched radio sample, could account 
for a significant fraction of the sub-mjy population. 

This is further strengthened by the distribution of the 
remaining sources in the MIR color-dolor diagram, shown 
in Fig.l23[ which is consistent with the expected distribu- 
tion for a mixed sample of star forming, AGN galaxies, 
and QSOs at all redshifts in the range from the local 
to the highest observable redshifts. An additional af- 
firmation of the mix of population arises from the 31 
sources that have been detected as X-ray point sources 
(see Fig. [23]) . It is worth noting however, that ~ 60% 
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Fig. 23.— Analogous to Fig. \W\ but for 813 VLA-COSMOS 
1.4 GHz sources (grey dots) that have either no identified, or a 
photometrically flagged, optical counterpart brighter t han i = 26, 
but that have IRAC detections (i.e. ~ 75%; see Sec. 13.41 1. Grey 
squares indicate the sources that have an optical counterpart with 
i < 26, but that is within a masked-out area. Black open squares 
represent the sources that have XMM point source detections. The 
clumping of objects in the upper left quadrant is consistent with 
both star forming, Seyfert and passive galaxies, in the redshift 
range from about 1.5 to 3, suggesting that over ~ 60% of the 
VLA-COSMOS 'remaining' sources are high redshift galaxies or 
QSOs. 

of these sources are consistent with higher redshift ob- 
jects (z > 1.3), while this is true only for ~ 50% of the 
galaxies in the matched radio sample (see Fig. [T9|) . We 
therefore conclude that the radio sources without identi- 
fied, or with flagged, optical counterparts brighter than 
i = 26 are most likely comprised of a mixture of dif- 
ferent source populations (SF, AGN, QSO), similar to, 
although on average at higher redshift then the radio 
sources in our matched radio sample. Further, the rel- 
atively small total percentage of these sources cannot 
significantly alter the results about the 'population mix' 
in the 1.4 GHz VLA-COSMOS radio sample, inferred in 
the previous section. 

7.3. Concluding remarks on the composition of the 
faint radio population 

The faint - submillijansky - radio population com- 
prises the radio population responsible for the up- 
turn of the differ ential radio source counts below 
1 mJy (see e.g. iBondi et al.l l2007f) . and it has 
been the subject of many studies, and a matter 
of gr e at debate in the past t hree decades (| Condon! 
1981 IWhrdhorst et all [19851: iGrappioni et alJ 119991: 
Seymour et alj l2004t ISimpson et al.l 12006ft . This radio 
population has been interpreted as a 'new' rising popu- 
lation of objects, that do not significantly contribute at 
higher radio flux density levels. However, results from 
studies that tried to reveal the exact composition of the 
sub-mjy population have been highly discrepant. It was 
suggested that the majority of this faint radio popula- 
tion consists of faint blue galaxies, and it was assumed 
that these galaxies are under going significant star forma- 
tio n {W indhor stTet al.l H 985). The spectroscopic study 
bv lBenn et alj ( 19931 ). although analyzing only less than 
20% of their radio sample, supported this result indi- 



cating that the fraction of SF and Seyfert galaxies rises 
from about 50% to 90% in the r ange from super- tosub- 
mJy flux densities. However, iGruppioni et alj (l999), 
who performed a deeper optical spectroscopic analysis 
of a larger fraction (~ 60%) of their radio sources, dis- 
agreed with this result identifying the majority of their 
sub-mjy radio sources with early type galaxies, in which 
the radio emission is produced by AGN. Recently, us- 
ing a combination of optical and radio morphology as an 
identifier for AGN and SF gala xies on ~ 90% of their 
radio sources in the SSA13 field, iFomalont etaTI ([2006) 
suggested that less than ~ 40% of the r adio sources be- 
low 1 mJy are AGN. On the other hand. rPadovan i~et al.l 
(12007ft based their SF/AGN classification on a combi- 
nation of optical morphologies, X-ray luminosities and 
radio-to-optical flux ratios of their radio sources in the 
CDFS (Chandra Deep Field South), and indicated that 
only about 20 — 40% of the faint radio sources are made- 
up of star forming galaxies. 

As already mentioned in the Sec. [1] there are two main 
reasons for such controversial results in the past litera- 
ture: a) The identification fraction of radio sources with 
optical counterparts spans a wide range (20% to 90%) in 
samples from radio deep fields, and b) the methods for 
the separation of SF from AGN galaxies have been highly 
heterogeneous. Further, the first and second point seem 
to exclude each other. Namely, the most efficient separa- 
tion methods (e.g. intermediate- to high- resolution op- 
tical spectroscopy) generally led to a small identification 
fraction, whereas large radio samples with high optical 
identification fractions usually lacked robust SF/AGN in- 
dicators. In this paper we have tried to make a strong 
conjunction of these two points. We introduced a new 
method to separate SF from AGN galaxies, which relies 
strongly on rest-frame color properties, and thus allowed 
us to perform a robust classification of ~ 65% (= 1558) 
of the 1.4 GHz VLA-COSMOS radio sources down to 
~ 50 /iJy with identified optical counterparts out to 
i = 26 without the need for e.g. optical spectroscopy. 
Thus, we managed to reach both a high identification 
fraction and a robust source classification. However, in 
order to avoid any possible biases, we also utilized the 
full panchromatic COSMOS data set to put constraints 
on the properties of the remaining ~ 35% of the radio 
sources that were not identified with optical counterparts 
with i < 26, or that have optical counterparts with i < 26 
but with uncertain photometry due to blending and sat- 
uration in the optical images. In short, we have obtained 
a complete view of the 'population mix' of the sub-mjy 
radio sources, using to date the largest sample of 2388 
faint (5cr « 50 fiJy) radio sources at 1.4 GHz. Our com- 
bined methods allow us to reveal the true nature of the 
sub-mjy radio sources in the VLA-COSMOS survey on 
a sound statistical basis. 

Uarvis fc Rawlingsl (|2004ft first suggested that the ob- 
served flattening of the differential radio source counts 
below 1 mJy may be caused by 'radio-quiet' AGN, i.e. 
radio-quiet QSOs and type-2 AGN (consistent with our 
'AGN' class), rather than star forming galaxies. Mak- 
ing use of the observed correlation between X-ray and 
radio luminosity and the empirically derived X-ray lumi- 
nosity function, they computed the 1.4 GHz luminosity 
function for radio-quiet AGN out to high redshifts, and 
used it as the key ingredient to model the radio source 
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count s at faint flux density levels (see I Jar vis fc Rawlingsl 
120041 and references therein). However, they did not take 
into account SF galaxies as an important population to 
explain the shape of the counts. Such an interpretation 
managed to represent the differential radio source counts 
well down to ~ 250 /iJy, but failed at fainter flux densi- 
ties. Our results, derived in the previous sections, show 
that indeed star forming galaxies are not the dominant 
population at sub-mJy levels in the range of 50 //Jy to 
0.7 Jy, however they still contribute about 30% to 40% 
at these flux densities. Based on our large sample of 2388 
VLA-COSMOS radio sources we find that our classified 
AGN, which comprise mostly of low- luminosity and type- 
2 AGN, make up 50 — 60% of the faint radio population 
with a decreasing trend towards fainter flux densities, 
while the identified QSOs, which are mostly type-1 AGN, 
form a minor contribution of ~ 10% of the sub-mJy ra- 
dio population in the range of 50 /xJy to 0.7 Jy. Thus, 
our observational results show that the 'population mix' 
in the faint radio population contains a fair contribution 
of both SF galaxies and (low-luminosity and obscured) 
AGN, at least down to 50 /iJy. Thus, separate luminos- 
ity functions for both populations have to be taken into 
account in order to fully explain the flattening of the ra- 
dio source counts below 1 mJy. The 1.4 GHz luminosity 
function for star forming and AGN galaxies at interme- 
diate redshift, as well as the modeling of the 1.4 GHz 
radio source counts using these new observational con- 
strains and further analyzes of the 'population mix' in 
the VLA-COSMOS survey, is going to be fully addressed 
and presented in a number of up-coming publications 
(ISmolcic et all 120071: ICiliegi et alj|2007bt iPaglione et all 
2007). 

8. SUMMARY 

Our newly developed rest-frame color based classifi- 
cation method, in conjunction with the VLA-COSMOS 
1.4 GHz observations, enabled us to make, for the first 
time, a thorough distinction between sources where the 
1.4 GHz radio emission is predominantly driven by star 
formation processes from those where it is driven by 
SMBH processes, regardless of the luminosity of the lat- 
ter, and apply it to currently the largest sample of 1558 
1.4 GHz sources, with optical counterparts brighter than 
*ab < 26, and complete down to ~ 50 /iJy. 

We have explored to full detail the composition of the 
sub-mJy radio population making use of the entire sam- 
ple of 2388 VLA-COSMOS radio sources detected above 
5cr (~ 50/iJy), in conjunction with the panchromatic (X- 
ray to radio) COSMOS data set. We find that SF galax- 
ies are not the dominant population at submillijansky 
flux density levels, as believed previously, but that they 
rather make up an approximately constant fraction of 
30 — 40% in the flux density range of ~ 50 //Jy to 0.7 mJy. 
The radio population at these flux densities is a mixture 
of roughly 30 - 40% of SF and 50 - 60% of AGN galaxies, 
with a minor contribution (~ 10%) of QSOs. 
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TABLE 1 

MULTI- WAVELENGTH CROSS-CORRELATION OF VLA-COSMOS 1.4 GHz RADIO SOURCES 





total 


■^spec 


IRAC 1 


MIPS 2 


XMM 3 


total radio sample 


2388 


520 


2058 


1117 


210 


matched radio sample" 


1558 


447 


1448 


799 


179 


stars 


2 





2 


2 





QSOs 


139 


31 


122 


78 


43 


SF galaxies 1 


340 


150 


322 


280 


16 


AGN galaxies' 


601 


262 


579 


267 


98 


high-z galaxies 11 


476 


4 


423 


172 


22 


remaining radio sample 


830 


73 


610 


318 


31 



1 Detected in the Spitzer/IRAC 3.6 fim band. 2 Detected in the Spitzer/MIPS 24 /an band within the shallow MIPS COSMOS 
survey at or above a signal to noise of 3. 3 X-ray point-sources associated with optical counterparts as described in lBrusa et al.l 
(2007a) * Radio source sample positionally matched to optical sources with AB i < 26 and outside masked-out regions in the NUV- 



NIR photometric catalog described bv lCapak et al.l J2007I) . 1 Galaxies at redshift < 1.3. 11 Galaxies at redshift > 1.3. ** Radio 
sources a) without optical counterparts with i < 26 (~ 70%), or b) with optical counterparts (i < 26) that lie in masked-out areas 
in the photometric catalog (~ 30%). 



APPENDIX 

THE REST-FRAME COLORS PI AND P2 

The rest-frame colors PI and P2 are a a linear superposi tion of colors in the wavelength range of 3500 - 5800 A, 
obtained using the modified Stromgren photometric system (|Odell et al.ll2002l ISmolcic et al.ll2006l the filter response 
curves are available at http://www.mpia-hd.mpg.de/COSMOS). PI, P2 arc principal component axes optimally 
characterizing the galaxy locus in 2D colo r-color s pace. PI measures the position along the galaxy locus, and P2 the 
position perpendicular to it (see Fig. 4 in ISmolcic et al.ll2006f) . As the galaxy locus is slightly curved, the functional 
form of the rest-frame colors is given separately for the blue and red ends, with a boundary at vz — yz — 0.646. Hence, 
for galaxies with vz — yz < 0.646 (P1,P2) are given as: 

PI = 0.911 (ci - 0.646) + 0.412 (c 2 - 0.261) (Al) 

P2 = -0.412 (ci - 0.646) + 0.911 (c 2 - 0.261), (A2) 

and for galaxies with vz — yz > 0.646 as: 

Pl = 0.952 (ci- 0.646) + 0.307 (c 2 - 0.261) (A3) 

P2 = -0.307 (ci - 0.646) + 0.952 (c 2 - 0.261), (A4) 

where c\ — vz — yz and c 2 = bz — yz. 

DERIVATION OF COMPLETENESS AND CONTAMINATION OF THE PHOTOMETRICALLY SELECTED SAMPLES OF 

STAR FORMING GALAXIES AND AGN 

SDSS/NVSS galaxy sample 

Given both the available spectroscopy and photometry of the local galaxy sample (SDSS/NVSS) we can easily access 
the completeness and contamination of the photometrically selected samples of SF and AGN galaxies as follows. In 
Fig. [24] we show the differential and cumulative distributions of PI for the SDSS/NVSS star forming, composite and 
AGN galaxies. Here the classification is based on the BPT diagram for the galaxies with emission lines in their spectra, 
while all galaxies without emission lines are included in the AGN class. These 'absorption line AGN' constitute the 
major fraction of all the AGN (~ 80%) in the sample. The top panel shows the PI histograms for these three types 
of objects, normalized by the total number of objects. In the middle panels in Fig. [24] we show the fractions for the 
SF, AGN and composite galaxies for the selection of SF (left panel) and AGN (right panel) galaxies. The fractions 
were computed in such a way that for each PI value (Plj) the distributions were normalized to the total number of 
galaxies in the sample with PI < Plj (for SF galaxies; left panel), or PI > Plj (for AGN galaxies; right panel). In 
this way, we obtain the fraction of SF, AGN and composite galaxies within the full sample that was selected only by 
applying a PI color-cut. Further, in the bottom panels in Fig. [M] we show the cumulative distributions of the SF (left 
panel) and AGN (right panel) galaxies, scaled to the total number of SF and AGN galaxies, respectively, thus showing 
their completeness as a function of PI. Selecting galaxies with Pl<0.15, and defining them as the 'photometrically 
selected sample of star forming galaxies', leads to a sample that contains ~ 20% AGN, ~ 10% composite objects, and 
~ 70% 'real' (i.e. spectroscopically identified) SF galaxies (see middle left panel in Fig. |2"4"]) . The latter make-up ~ 85% 
of all 'real' SF galaxies (see bottom left panel in Fig. 124]) . On the other hand, using the color cut to generate the 
'photometrically selected AGN galaxy sample' (Pl>0.15) leads to a sample that contains ~ 5% SF galaxies, ~ 15% 
composite objects, and ~ 80% 'real' AGN galaxies (see middle right panel in Fig. |2"4"]), where the latter make-up ~ 90% 
of all 'real' AGN galaxies (see bottom right panel in Fig. [24]) . 
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Fig. 24. — Differential and cumulative distributions of the rest-frame color PI for ~ 7, 000 SDSS galaxies, in the redshift range of 
0.01 to 0.3, drawn from the DR1 "main" spectroscopic sample, that have 1.4 GHz NVSS detections. The galaxies were spectroscopically 
classified as star forming (SF; thick line), AGN (dashed line) or composite (dot-dashed line; see text for details). The top panel shows the 
PI histograms for these three types of objects. The middle left panel shows the fraction of these three types of galaxies as a function of 
PI. These distributions are normalized in such a way that for each PI value the fraction of the three types of galaxies within the total 
selected sample can be read off. The bottom left panel shows the cumulative distribution of PI for SF galaxies, thus showing to which 
completeness the 'real' (i.e. spectroscopically classified) SF galaxies are selected for any given PI color-cut. The bottom and middle right 
panels are analogous to the bottom and middle left panels, but for AGN galaxies. Note also that the cumulative distributions for AGN 
were computed as a function of decreasing PI. The vertical (thin solid) lines in the middle and bottom panels designate the value of PI, 
chosen to separate SF galaxies from AGN. 

SDSS/NVSS/IRAS galaxy sample 

The BPT diagram for ~ 830 SDSS/NVSS/IRAS emission line galaxies is shown in Fig. [25l color-coded using the 
(P1,P2) plane. A strong correlation between PI and log ([Nil 6584]/Ha), very similar to the correlation observed for 
the full sample, is discernible. 

The differential distribution of PI for the complete sample of 875 SDSS/NVSS/IRAS galaxies is shown in the top 
panel of Fig. [2H The galaxies were spectroscopically separated into AGN, star-forming and composite galaxies (as 
described in Sec. I4.1.ip . In the middle panel we show the corresponding fraction (analogous to the middle left panel 
in Fig. HM)) . Using a rest-frame PI color cut-off of 0.15 the selected sample of star forming galaxies, detected in the 
IR regime, contains ~ 70% 'real' (i.e. spectroscopically identified) SF galaxies (that make up ~ 85% of all 'real' SF 
galaxies; see bottom panel), 10% AGN, and 20% composite objects. This is fairly consistent with the properties of 
the entire SDSS/NVSS sample. 

In order to assess the fraction of dusty starburst galaxies that we omit using the rest-frame color selection, in 
the bottom panel of Fig. [26] we show the cumulative distributions of the spectroscopically identified star forming 
galaxies as a function of PI for i) all star forming galaxies, ii) luminous IR galaxies (LIRGs; Lir = 10 11-12 L©) 
and iii) ultra-luminous IR galaxies (ULIRGs, Lir, > 10 12 L Q ). [The total IR luminosities were computed following 
iSanders fc Mir abel (1996).] A PI cut of 0.15 misses only ~ 10% and ~ 5% of luminous and ultra-luminous starburst 
galaxies, respectively. 
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Fig. 25.— Analogous to Fig. H] but for ~ 830 SDSS/NVSS/IRAS emission line galaxies. 
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Fig. 26. — Differential and cumulative distributions of the rest-frame color PI for 875 SDSS galaxies from the DR1 "main" spectroscopic 
sample, that have NVSS and IRAS detections. The galaxies were spectroscopically classified as star forming (SF; thick line), AGN (dashed 
line) or composite (dot-dashed line; see text for details), and their PI histograms are shown in the top panel. The middle panel shows 
the fraction of PI for SF, AGN and composite galaxies, analogous to the middle left panel in Fig. 1241 The vertical thin solid line shows 
the chosen PI color cut-off. The bottom panel shows the cumulative distribution of PI, normalized to SF galaxy counts (thick line). Also 
shown are the PI distributions of SF galaxies with IR luminosities greater than 10 11 Lq and 10 12 Lq consistent with LIRGs and ULIRGs, 
respectively. Note that our rest-frame color based classification method is not strongly biased against dusty starburst galaxies. 



